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EPITOPE SEQUENCES 

Field of the Invention 

The present invention generally relates to peptides, and nucleic acids encoding peptides, 
that are uscM epitopes of target-associated antigens. More specifically, the invention relates to 
5 epitopes that have a high affinity for MHC class I and that are produced by target-specific 

proteasomes. 

Description of the Related Art 

Neo plasia and the Immune System 

The neoplastic disease state commonly known as cancer is thought to result generally fi-om 
10 a single ceil growing out of control. The uncontrolled growth state typically results firom a multi- 

step process in which a series of cellular systems fail, resulting in the genesis of a neoplastic cell. 
The resultmg neoplastic cell rapidly reproduces itself, forms one or more tumors, and eventually 
may cause the death of the host. 

Because the progenitor of the neoplastic cell shares the host's genetic material, neoplastic 
1 5 cells are largely unassailed by the host's immune system. During immune surveillance, the process 

in which the host's immune system surveys and localizes foreign materials, a neoplastic cell will 
appear to the host's immune surveillance machinery as a "self' cell. 
Viruses and the Immune System 

Li contrast to cancer cells, virus infection involves the expression of clearly non-self 

20 antigens. As a result, many virus infections are successfully dealt with by the immune system with 

minimal clinical sequela. Moreover, it has been possible to develop efiFective vaccines for many of 
those infections that do cause serious disease. A variety of vaccine approaches have been used 
successfiilly to combat various diseases. These approaches include subunit vaccines consisting of 
individual proteins produced through recombinant DNA technology. Notwithstanding these 

25 advances, the selection and effective administration of minimal epitopes for use as viral vaccines 

has remained problematic. 

In addition to the difficulties involved in epitope selection stands the problem of viruses 
that have evolved the capability of evading a host's immune system. Many viruses, especially 
viruses that establish persistent infections, such as members of the herpes and pox virus femilies, 

30 produce immunomodulatory molecules fliat permit the virus to evade the host's immune systenL 

The effects of these immunomodulatory molecules on antigen presentation may be overcome by 
the targeting of select epitopes for administration as immunogenic compositions. To better 
understand the interaction of neoplastic cells and virally infected cells with the host's immune 
S3^tem, a discussion of the system's components follows below. 

35 The immune system functions to discriminate molecules endogenous to an organism 

("self molecules) from material exogenous or foreign to the organism C*non-self' molecules). The 
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immune system has two types of adaptive responses to foreign bodies based on the components that 
mediate the response: a humoral response and a cell-mediated response. The humoral response is 
mediated by antibodies, while the cell-mediated response involves cells classified as lymphocytes. 
Recent anticancer and antiviral strategies have focused on mobilizing the host immune system as a 

5 means of anticancer or antiviral treatment or therapy. 

The immune system functions in tiiree phases to protect the host from foreign bodies: the 
cognitive phase, the activation phase, and the effector phase. In the cognitive phase, the immune 
system recognizes and signals the presence of a foreign antigen or invader in the body. The foreign 
antigen can be, for exanqjle, a cell surface marker from a neoplastic cell or a viral protein. Once the 

10 system is aware of an invading body, antigrai specific cells of the immune system proliferate and 

differentiate in response to (he invader-triggered signals. The last stage is the effector sta^ in which 
the effector cells of the immune system respond to and neutralize the detected invada:. 

An array of effector cells inq)lements an immune response to an invader. One type of 
effector cell, the B cell, generates antibodies targeted against foreign antigens encountered by the 

15 host In combination with the conq)lement system, antibodies direct the destruction of cells or 

organisms bearing the targeted antigen. Another type of effector cell is the natural killo- cell (NK 
cell), a type of lynq)hocyte having the capacity to spontaneously recognize and destroy a variety of 
virus infected cells as well as malignant cell types. The method used by NK cells to recognize target 
cells is poorly understood. 

20 Another type of effector cell, the T cell, has members classified into three subcategories, each 

playing a different role in the immune response. Helper T cells secrete (^okines which stimulate tiie 
proliferation of other cells necessary for mounting an effective immune response, y/bUe suppressor T 
cells down-regulate the immune response, A third category of T cell, the cytotoxic T cell (CTL), is 
capable of directly lysing a targeted cell presenting a foreign antigen on its surface. 

25 The Major Histocompatibilitv Complex and T Cell Target Recognition 

T cells are antigen-specific immune cells that function in response to specific antigen signals. 
B lymphocytes and the antibodies they produce are also antigen-specific entities. However, unlike B 
lymphocytes, T cells do not respond to antigens in a free or soluble form. For a T cell to respond to 
an antigen, it requires the antigen to be processed to peptides which are then bound to a presaiting 

30 structure encoded in the major histocompatibihty complex (MHC). This requirement is called *'MHC 

restriction" and it is the mechanism by which T cells differentiate "self* from '*non-self ' cells. If an 
antigen is not displayed by a recognizable MHC molecule, the T cell will not recognize and act on the 
antigen signal. T cells specific for a peptide bound to a recognizable MHC molecule bind to these 
MHC-peptide complexes and proceed to the next stages of the immune response. 

35 There are two types of MHC, class I MHC and class n MHC. T Helper cells (CD4^ 

predominately interact with class II MHC proteins while cytolytic T cells (CD8^ predominately 
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interact with class I MHC proteins. Both classes of MHC protein are transmembrane proteins with a 
majority of their structure on the external surface of the cell. Additionally, both classes of MHC 
proteins have a peptide binding cleft on Aeir external portions. It is in this cleft that small fragmoits 
of proteins, endogenous or foreign, are bound and presented to the ^dracellular enviromnent 
5 Cells called •'professional antigen presenting cells" (pAPCs) display antigens to T cells using 

the MHC proteins but additionally e^qness various co-stimulatory molecules depending on the 
particular state of differentiation/activation of the pAPC. When T cells, specific for the peptide bound 
to a recognizable MHC protein, bind to these MHC-peptide complexes on pAPCs, the specific co- 
stimulatory molecules that act upon the T cell direct the path of differcntiation/actiYation taken by the 

10 T cell That is, the co-stimulation molecules affect how the T cell will act on antigenic signals in 

future encounters as it proceeds to the next stages of the immune response. 

As discussed above, neoplastic cells are largely ignored by the immune system. A great deal 
of effort is now being expended in an atten^t to harness a host's immune system to aid in combating 
the presence of neoplastic cells in a host One such area of research involves the formulation of 

1 S anticancer vaccines. 

Anticancer Vaccines 

Among the various weapons available to an oncologist in the battle against cancer is the 
immune system of the patient Work has been done in various attempts to cause the immune 
system to combat cancer or neoplastic diseases. Unfortunately, the results to date have been 
20 largely disappointing. One area of particular interest involves the generation and use of anticancer 

vaccines. 

To generate a vaccine or other immunogenic composition, it is necessary to introduce to a 
subject an antigen or qsitope against which an immune response may be mounted. Although 
neoplastic cells are derived from and therefore are substantially identical to normal cells on a 

25 genetic level, many neoplastic cells are known to present tumor-associated antigens (TuAAs). In 
theory, these antigens could be used by a subject's immune system to recognize these antigens and 
attack the neoplastic cells, hi reality, however, neoplastic cells generally appear to be ignored by 
the host's immune system. 

A number of differait strategies have been developed in an attempt to generate vaccines 

30 with activity against neoplastic cells. These strategies include the use of tumor-associated antigens 
as iramunogens. For example, U.S. Patent No. 5,993,828, describes a method for producing an 
immune response against a particular subunit of the Urinary Tumor Associated Antigen by 
administering to a subject an effective dose of a composition comprising inactivated tumor cells 
having the Urinary Tumor Associated Antigen on the cell sur&ce and at least one tumor associated 

35 antigen selected from the group consisting of GM-2, GD-2, Fetal Antigen and Melanoma 
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Associated Antigen. Accordingly, this patent describes using whole, inactivated tumor cells as the 
immunogen in an anticancer vaccine. 

Another strategy used with anticancer vaccines involves administering a conq)osition 
containing isolated tumor antigens. In one ^proach, MAGE-Al antigenic peptides were used as 
5 an immunogen- (See Chaux, P., et al, "Identification of Five MAGE-Al Epitopes Recognized by 
Cytolytic T Lymphocytes Obtained by In Vitro Stimulation wifli Dendritic Cells Transduced with 
MAGE-Al," J. Immunol,, 163(5):2928-2936 (1999)). There have been several therapeutic trials 
using MAGE-Al peptides for vaccination, although the effectiveness of the vaccination regimes 
was limited. The results of some of these trials are discussed in Vose, J.M., *Tumor Antigens 

10 Recognized by T Lymphocytes," 10^ European Cancer Conference, Day 2. Sept. 14, 1999. 

In another example of tumor associated antigens used as vaccines, Scheinberg, et al. 
treated 12 chronic myelogenous leukemia (CML) patients already receiving interferon (IFN) or 
hydroxyurea with 5 injections of class I-associatcd bcr-abl peptides with a helper peptide plus the 
adjuvant QS-2L Scheinberg, D.A., et al, '*BCR-ABL Breakpoint Derived Oncogene Fusion 

15 Peptide Vaccines Generate Specific Immune Responses in Patients with Chronic Myelogenous 
Leukemia (CML) [Abstract 1665], American Society of Clinical Oncology 35* Annual Meeting, 
Atlanta (1999). Proliferative and delayed type hypersensitivity (DTH) T cell responses indicative 
of T-helper activity were elicited, but no cytolytic killer T cell activity was observed within the 
fresh blood samples. 

20 Additional examples of attempts to identify TuAAs for use as vaccines are seen in the 

recent work of Cebon, et al and Scheibenbogen, et al Cebon, et al immunized patients with 
metastatic melanoma using intradermallly administered MART-I26.3S peptide with IL-12 in 
increasing doses given either subcutaneously or intravenously. Of the first 15 patients, 1 con^lete 
remission, 1 partial remission, and 1 mixed response were noted. Ibnmunc assays for T cell 

25 generation included DTH, which was seen in patients with or without IL-12. Positive CTL assays 
were seen in patients with evidence of clinical benefit, but not in patients without tumor regression. 
Cebon, et al, "Phase I Studies of hnmunization witii Melan-A and IL-12 in HLA A2+ Positive 
Patients with Stage HI and IV Malignant Melanoma," [Abstract 1671], American Society of 
Clinical Oncology 35* Annual Meeting, Atlanta (1999), 

30 Scheibenbogen, et al immunized 18 patients with 4 HLA class I restricted tyrosinase 

peptides, 16 with metastatic melanoma and 2 adjuvant patients. Scheibenbog^ et al, 
"Vaccination with Tyrosinase peptides and GM-CSF in Metastatic Melanoma: a Phase II Trial," 
[Abstract 1680], American Society of Clmical Oncology 35* Annual Meeting, Atlanta (1999). 
Increased CTL activity was observed in 4/15 patients, 2 adjuvant patients, and 2 patients with 

35 evidence of tumor regression. As in the trial by Cebon, et al, patients with progressive disease did 
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not show boosted inmiumty. In spite of the various efforts expended to date to generate efficacious 
anticancer vaccines, no such con:q}Osition has yet been developed. 
Antiviral Vaccines 

Vaccine strategies to protect against viral diseases have had many successes. Perhaps the 
most notable of these is the progress that has been made against the disease small pox, which has 
been driven to extinction. The success of the polio vaccine is of a similar magnitude. 

Viral vaccines can be grouped into three classifications: live attenuated virus vaccines, 
such as vaccinia for small pox, the Sabin poliovirus vaccine, and measles mumps and rubella; 
whole killed or inactivated virus vaccines, such as the Salk poliovirus vaccine, hepatitis A virus 
vaccine and the typical influenza virus vaccines; and subunit vaccines, such as hepatitis B. Due to 
their lack of a complete viral genome, subunit vaccines offer a greater degree of safety than those 
based on whole viruses. 

The paradigm of a successful subimit vaccine is the recombinant hepatitis B vaccine based 
on the viruses envelope protein. Despite much academic interest in pushing the reductionist 
subunit concept beyond single proteins to individual epitopes, the efforts have yet to bear much 
&uit. Viral vaccine research has also concaitrated on the induction of an antibody response 
although cellular responses also occur. However, many of the subunit formulations are particularly 
poor at generating a CTL response. 

Summary of the Invention 

Previous methods of priming professional antigen presenting cells (pAPCs) to display 
target cell epitopes have relied simply on causing tbe pAPCs to express target-associated antigens 
(TAAs), or epitopes of those antigens which are thought to have a high affinity for MHC I 
molecules. However, the proteasomal processing of such antigens results in presentation of 
epitopes on tiie pAPC that do not correspond to the epitopes present on the target cells. 

Using the knowledge that an effective cellular immune response requires that pAPCs 
present the same epitope that is presented by the target cells, the present invention provides 
epitopes that have a high affinity for MHC I, and that correspond to the processing specificity of 
the housekeeping proteasome, which is active in peripheral cells. These epitopes thus correspond 
to those presented on target cells. The use of such epitopes in vaccines can activate the cellular 
immune response to recognize the correctly processed TAA and can result in removal of target 
cells that present such epitopes. In some embodiments, the housekeeping epitopes provided herein 
can be used in combination with immime epitopes, generating a cellular immune response that is 
competent to attack target cells both before and after interferon induction. Li other embodiments 
the epitopes are useful in the diagnosis and monitoring of the target-associated disease and in the 
generation of immunological reagents for such purposes. 
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Embodiments of the invention relate to isolated epitopes and antigens or polypeptides that , 
comprise the epitopes. Preferred embodiments include an epitope or antigen having the sequence 
as disclosed in TABLE 1. Other embodiments can include an epitope cluster comprising a 
polypeptide from Table 1. Further, emodiments include a polypeptide having substantial similarity 
5 to the aheady mentioned epitopes, antigens, or clusters. Odier preferred embodiments include a 
polypeptide having functional similarity to any of the above. Still further embodiments relate to a 
nucleic acid encoding the polypeptide of any of the epitopes, clusters, antigens, and polypeptides 
from Table 1 and mentioned herein. 

The epitope can be immunologically active. The polypeptide comprising the epitope can 
10 be less than about 30 amino acids in length, more preferably, the polypeptide is 8 to 10 amino acids 

in length, for example. Substantial or functional similarity can include addition of at least one 
amino acid, for example, and the at least one additional amino acid can be at N-tenninus of the 
polypeptide. The substantial or functional similarity can include a substitution of at least one 
amino acid. 

15 The epitope, cluster, or polypeptide comprising the same can have affinity to an HLA-A2 

molecule. The affinity can be determined by an assay of binding, by an assay of restriction of 
epitope recognition, by a prediction algorithm, and the like. The epitope, cluster, or polypeptide 
comprising the same can have affinity to an HLA-B7, HLA-B5 1 molecule, and the like. 

In preferred embodiments the polypeptide can be a housekeeping epitope. The epitope or 

20 polypeptide can correspond to an epitope displayed on a tumor cell, to an epitope displayed on a 

neovasculature cell, and the like. The epitope or polypeptide can be an immune epitope. The 
epitope, cluster and/or polypeptide can be a nucleic acid. 

Other embodiments relate to pharmaceutical compositions con^rising the polypq}tides, 
including an epitope from Table I, a cluster, or a polypeptide comprising the same and a 

25 pharmaceutically acceptable adjuvant, carrier, diluent, excipient, and the like. The adjuvant can be 

a polynucleotide. The polynucleotide can include a dinucleotide. The dinucleotide can be CpG, 
for example. The adjuvant can be encoded by a polynucleotide. The adjuvant can be a cytokine 
and the cytokine can be, for example, GM-CSF. 

The pharmaceutical compositions can further include a professional antigen-presenting cell 

30 (pAPC). ThepAFC can be a dendritic cell, for example. The pharmaceutical composition can 

fiirther include a second epitope. The second epitope can be a polypeptide. The second epitope 
can be a nucleic acid. The second epitope can be a housekeeping epitope, an immune epitope, and 
the like. 

Still further embodiments relate to pharmaceutical con^sitions that include any of the 
35 nucleic acids discussed herein, including those that encode polypeptides that comprise epitopes or 
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antigens from Table 1. Such compositions can include a phamiaceutically acceptable adjuvant, « 
carrier, diluent, excipient, and HhG like. 

Other embodiments relate to recombinant constructs that include such a nucleic acid as 
described herein, including those that encode polypeptides that comprise epitopes or antigens from 
5 Table 1. The constructs can further include a plasmid, a viral vector, an artificial chromosome, and 

the like. The construct can further include a sequence encoding at least one feature, such as for 
example, a second epitope, an IRES, an ESS, an NIS, ubiquitin. 

Further embodiments relate to purified antibodies that specifically bind to at least one of 
the epitopes in Table 1 A. Other embodiments relate to purified antibodies that specifically bind to 
10 a peptide-MHC protein complex comprising an epitope disclosed in Table lA or any other suitable 

epitope. The antibody from any embodiment can be a monoclonal antibody. 

Still other embodiments relate to multimeric MHC-peptide con:q}lexes that include an 
epitope, such as, for example, an epitope disclosed in Table 1. 

Embodiments relate to isolated T cells expressing a T cell receptor specific for an MHC- 
15 peptide complex. The complex can include an epitope, such as, for example, an epitope disclosed 

in Table 1 of claim 1. The T cell can be produced by an in yntro immimization. The T cell can be 
isolated from an inmiunized animal. Embodiments relate to T cell clones, including cloned T cells, 
such as those discussed above. Embodiments also relate to polyclonal population of T cells. Such 
populations can include a T cell, as described above, for example. 
20 Still further embodiments relate to pharmaceutical con^sitions that include a T cell, such 

as those described above, for example, and a pharmaceutically acceptable adjuvant, carrier, diluent, 
excipient, and the like. 

Embodiments of the invention relate to isolated protein molecules comprising the binding 
domain of a T cell receptor specific for an MHC-peptide contplex. The complex can include an 

25 epitope disclosed in Table 1 . The protein can be multivalent. Other embodiments relate to isolated 

nucleic acids encoding such proteins. Still fiirther embodiments relate to recombinant constructs 
that include such nucleic acids. 

Other embodiments of the invention relate to host cells expressing the recombinant 
constmct described herein, including constructs encoding an epitope, cluster or polypeptide 

30 comprising the same, disclosed in Table 1, for example. The host cell can be a dendritic cell, 

macrophage, tumor cell, tumor-derived cell, and the like. The host cell can be a bacterium, fungus, 
protozoan and the like. Exnbodiments also relate to pharmaceutical compositions that include a 
host cell, such as those discussed herein, and a pharmaceutically acceptable adjuvant, canier, 
diluent, excipient, and the like. 

35 Still other embodiments relate to vaccines or immunotherapeutic compositions that include 

at least one component, such as, for example, an epitope disclosed in Table 1 or otherwise 
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described herein; a cluster that includes such an epitope, an antigen or polypeptide that includes 
such an epitope; a composition described above and herein; a construct, a T cell, or a host cell as 
described above and herein. 

Further embodiments relate to methods of treating an animal. The methods can include 
administering to an animal a vaccine or immunotherapeutic composition, including those disclosed 
above and herein. The administering step can include a mode of delivezy, such as, for example, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
mucosal, aerosol inhalation, instillation, and the like. The method can further include a step of 
assaying to determine a characteristic indicative of a state of a target cell or target cells. The 
method can include a first assaying step and a second assaying step, wherein the first assaying step 
precedes tiie administering step, and wherein the second assaying step follows the administering 
step. The method can fiurther include a step of comparing the characteristic determined in the first 
assaying step with the characteristic determined in the second assaying step to obtain a insult. The 
result can be for example, evidence of an immune response, a diminution in number of target cells, 
a loss of mass or size of a tumor comprising target cells, a decrease in number or concentration of 
an intracellular parasite infecting target cells, and the like. 

Embodiments relate to methods of evaluating immunogenicity of a vaccine or 
immunotherapeutic composition. The methods can include administering to an animal a vaccine or 
immunotherapeutic, such as those described above and elsewhere herein, and evaluating 
immunogenicity based on a characteristic of the animal. The animal can be HLA-transgenic. 

Other embodiments relate to methods of evaluating immunogenicity that include in vitro 
stimulation of a T cell with the vaccine or immunotherapeutic composition, such as those described 
above and elsewhere herein, and evaluating immunogenicity based on a characteristic of the T cell. 
The stimulation can be a primary stimulation. 

Still further embodiments relate to methods of making a passive/adoptive 
immunotherapeutic. The metfiods can include combining a T cell or a host cell, such as tiiose 
described above and elsewhere herein, with a phaimaceutically acceptable adjuvant, carrier, 
diluent, excipient, and the like. 

Other embodiments relate to methods of detemiining specific T cell frequency, and can 
include the step of contacting T cells with a MHC-peptide complex comprising an epitope 
disclosed in Table 1, or a complex comprising a cluster or antigen comprising such an epitope. The 
contacting step can include at least one feature, such as, for exanQ)le, immunization, restimulation, 
detection, enumeration, and the like. The method can furflier include ELISPOT analysis, limiting 
dilution analysis, flow cytometry, in situ hybridization, the polymerase chain reaction, any 
combination thereof and the like. 
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Embodiments relate to methods of evaluating immunologic response. The methods can 
include the above-described methods determining specific T cell frequency carried out prior to and 
subsequent to an immunization step. 

Another embodiment relates to methods of evaluating immunologic response. The 
methods can include determining frequency, cytokine production, or cytolytic activity of T cells, 
prior to and subsequent to a step of stimulation with MHC-peptide complexes con^irising an 
epitope, such as, for example an epitope from Table 1, a cluster or a polypepti<k comprising such 
an epitope. 

Further embodiments relate to methods of diagnosing a disease. The methods can include 
contacting a subject tissue with at least one component, including, for example, a T cell, a host cell, 
an antibody, a protein, including those described above and elsewhere herein; and diagnosing the 
disease based on a characteristic of the tissue or of the component The contacting step can take 
place in vivo. The contacting step can take place in vitro. 

Still olber embodiments relate to methods of making a vaccine. The methods can include 
combining at least one component, an epitope, a composition, a construct, a T cell, a host cell; 
including any of those described above and elsewhere herein, with a phannaceutically acceptable 
adjuvant, carrier, diluent, excipient, and the like. 

Embodiments relate to computer readable media having recorded thereon the sequence of 
any one of SEQ JD NOS: 1 -602, in a machine having a hardware or software that calculates the 
physical, biochemical, immunologic, or molecular genetic properties of a molecule embodying said 
sequence. 

Still other embodiments relate to metiiods of treating an anhnal. The methods can include 
combining the mefhod of treating an animal that includes administering to the animal a vaccine or 
immunotherapeutic composition, such as described above and elsewhere herein, combined witii at 
least one mode of treatment, including, for example, radiation therapy, chemotherapy, 
biochemotherapy, surgery, and the like. 

Further embodiments relate to isolated polypeptides that include an epitope cluster from a 
target-associated antigen having the sequence as disclosed in any one of Tables 25-44, wiierein the 
amino acid sequence includes not more than about 80% of the amino acid sequence of the antigen. 

Other embodiments relate to vaccines or immunotherapeutic products tiiat include an 
isolated peptide as described above and elsewhere herein. Still other embodiments relate to 
isolated polynucleotides encoding a polypeptide as described above and elsewhere herein. Other 
embodimaits relate vaccines or immunotherapeutic products that include these polynucleotides. 
The polynucleotide can be DNA or RNA. 
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Brief Description of the Drawings 

Figure 1 is a sequence alignnient of NY-ESO-1 and several similar protein sequences. 

Figure 2 graphically represents a plasnrid vaccine backbone useful for delivering nucleic 
acid-encoded epitopes. 

Figures 3A and 3B are FACS profiles showii^ results of HLA-A2 binding assays for 
tyrosinase207-2i5 and tyrosinaseaog-zid- 

Figure 4 is a T=120 min. time point mass spectrum of the firagments produced by 
proteasomal cleavage of SSX-23]^8- 

Figure 5 shows a binding curve for HLA-A2:SSX-24i^9 with controls. 

Figure 6 shows specific lysis of SSX-24Mir13ulsed targets by CTL from SSX-24Mg- 
immunized HLA-A2 transgenic mice. 

Figure 7 A, B, and C show results of N-terminal pool sequencing of a T=60 min. time point 
aliquot of the PSMAi«3.i92 proteasomal digest 

Figure 8 shows binding curves for HLA-A2:PSMAi68-i77 and HLA-A2:PSMA288-297 with 
controls. 

Figure 9 shows results of N-terminal pool sequencing of a T«=6G min. time point aliquot of 
the PSMA281-310 proteasomal digest. 

Figure 10 shows binding curves for HLA-A2:PSMA46i^, HLA-A2:PSMA4«m69, and 
HLA-A2:PSMA663.e7i. with controls. 

Figure 11 shows the results of a y-IFN-based ELISPOT assay detecting PSMAjss^ti- 
reactive HLA-Al"" CDS*" T cells. 

Figure 12 shows blocking of reactivity of the T cells used in figure 10 by anti-HLA-Al 
mAb, demonstrating HLA-Al-restricted recognition. 

Figure 13 shows a binding curve for HLA-A2:PSMA663^i, with controls. 

Figure 14 shows a binding curve for HLA-A2:PSMA662-<7i, with controls. 

Figure 15. Comparison of anti-peptide CTL responses following immunization with 
various doses of DNA by different routes of injection. 

Figure 16. Growth of transplanted gp33 expressing tumor in mice inamunized by i.bu 
injection of gp33 epitope-expressing, or control, plasmid. 

Figure 17. Amount of plasmid DNA detected by real-time PCR in injected or draining 
lymph nodes at various times after i.hi. of i.m. injection, respectively. 

Detailed Description of the Preferred Embodiment 

Definitions 

Unless otherwise clear from the context of the use of a term herein, the following listed 
terms shall generally have the indicated meanings for purposes of this description. 
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PROFESSIONAL ANnGEN-PRESENTING CELL (pAPC) - a cell that possesses T cell 
costimulatory molecules and is able to induce a T cell response. Well characterized pAPCs include 
dendritic cells, B cells, and macrophages. 

PERIPHERAL CELL - a cell that is not a pAPC. 

HOUSEKEEPING PROTEASOME - a proteasome normaUy active in peripheral cells, and 
generally not present or not strongly active in pAPCs. 

IMMUNE PROTEASOME - a proteasome normally active in pAPCs; &e immune 
proteasome is also active in some peripheral cells in infected tissues. 

EPITOPE - a molecule or substance capable of stimulating an immune response. In 
preferred embodiments, epitopes according to this definition include but are not necessarily limited 
to a polypeptide and a nucleic acid encoding a polypeptide, wherein the polypeptide is capable of 
stimulating an immime response. In other preferred embodiments, epitopes according to this 
definition include but are not necessarily limited to peptides presented on the surface of cells, the 
peptides being non-covalently bound to the binding cleft of class I MHC, such that they can 
interact with T cell receptors. 

MHC EPrrOPE - a polypeptide having a known or predicted binding affinity for a 
mammalian class I or class n major histocompatibility complex (MHC) molecule. 

HOUSEKEEPING EPITOPE ~ In a preferred embodiment, a housekeeping epitope is 
defined as a polypeptide firagment that is an MHC epitope, and that is displayed on a cell in which 
housekeeping proteasomes are predominantly active. In another preferred embodiment, a 
housekeeping epitope is defmed as a polypeptide containing a housekeeping epitope according to 
the foregoing definition, that is flanked by one to several additional amino acids. In another 
prefenred embodiment, a housekeeping epitope is defined as a nucleic acid that encodes a 
housekeeping epitope accordmg to ihc foregoing definitions. 

IMMUNE EPITOPE - In a preferred embodiment, an immune epitope is defined as a 
polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which immune 
proteasomes are predominantly active. In another preferred embodiment, an immune epitope is 
defined as a polypeptide containing an immune epitope according to the foregoing definition, that 
is flanked by one to several additional amino acids, hi another preferred embodiment, an immune 
epitope is defined as a polypeptide including an epitope cluster sequence, having at least two 
polypeptide sequences having a known or predicted affmity for a class I MHC. In yet another 
preferred embodiment, an immune epitope is defined as a nucleic acid that encodes an immune 
epitope according to any of the foregoing definitions. 

TARGET CELL - a cell to be targeted by the vaccines and methods of the invention. 
Examples of target cells according to tiiis definition include but arc not necessarily limited to: a 
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neoplastic cell and a cell harboring an intracellular parasite, such as, for example, a virus, a 
bacterium, or a protozoan. 

TARGET-ASSOCIATED ANTIGEN (TAA) - a protein or polypeptide present in a target 

cell. 

TUMOR-ASSOCIATED ANTIGENS (TuAA) - a TAA, wherein the target cell is a 
neoplastic cell. 

HLA EPrrOPE - a polypeptide having a known or predicted binding affinity for a human 
class I or class n HLA complex molecule. 

ANTIBODY - a natural immunoglobulin (Ig), poly- or monoclonal, or any molecule 
composed in whole or in part of an Ig binding domain, whether derived biochemically or by use of 
recombinant DNA. Exan5)les include inter alia, F(ab), single chain Fv, and Ig variable region- 
phage coat protein fusions. 

ENCODE - an open-ended tenn such that a nucleic acid encoding a particular amino acid 
sequence can consist of codons specifying that (poly)peptide, but can also comprise additional 
sequences either translatable, or for ±c control of transcription, translation, or replication, or to 
&cilitate manipulation of some host nucleic acid construct. 

SUBSTANTIAL SIMBLARTTY - this term is used to refer to sequences that differ from a 
reference sequence in an inconsequential way as judged by examination of the sequence. Nucleic 
acid sequences encoding the same amino acid sequence are substantially similar despite differences 
in degenerate positions or modest differences in length or composition of any non-coding regions. 
Amino acid sequences differing only by conservative substitution or minor lengdi variations are 
substantially similar. Additionally, amino acid sequences comprising housekeeping epitopes that 
differ in the number of N-terminal flanking residues, or immune epitopes and epitope clusters that 
differ in the number of flanking residues at either terminus, are substantially similar. Nucleic acids 
that encode substantially similar amino acid sequences are themselves also substantially similar. 

FUNCTIONAL SBVULARITY - this term is used to refer to sequences that differ from a 
reference sequence in an inconsequential way as judged by examination of a biological or 
biochemical property, although the sequences may not be substantially similar. For example, two 
nucleic acids can be useful as hybridization probes for the same sequence but encode differing 
amino acid sequences. Two peptides that induce cross-reactive CTL responses are functionally 
similar even if they differ by non-conservative amino acid substitutions (and thus do not meet the 
substantial similarity definition). Pairs of antibodies, or TCRs, that recognize the same epitope can 
be functionally similar to each other despite whatever structural differences exist. In testing for 
functional similarity of immunogenicity one would generally immunize vnth the "altered" antigen 
and test the ability of the elicited response (Ab, CTL, cytokine production, etc.) to recognize the 
target antigen. Accordingly, two sequences may be designed to differ in certain reacts while 
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retaining the S3nie function. Such designed sequence variants are among the embodiments of &e 
present invention. 

Table lA. S£Q ID NOS * including epitopes in Examples 1-7, 13. 



SEQIDNO 






I 


lyr 207-216 


FLPWHRLFLL 


2 


Tyrosinase protein 


Accession number**: P14679 


3 


SSX-2 protein 


Accession number NP 003138 


4 


PSMA protein 


Accession number: NP 004467 


5 


Tyrosinase cDNA 


Accession number: NM 000372 


6 


SSX-2 cDNA 


Accession number: NM 003147 


7 


PSMA cDNA 


Accession number: NM 004476 


8 


Tyr 207-215 


FLPWHRLFL 


9 


Tyr 208-216 


LPWHRLFLL 


10 


SSX-2 31-68 


YFSKEEWEKMKASEKIFY\rmKEKYEAMTKLGFK 

ATT P 


11 






12 


<55vY-9 ^0-47 




13 






14 




K'A/TK' A QT71?"TPV 


15 






16 


^<5Y-9 40-40 


IviJVrVoIlJ^Jl* 1 V 


17 




K" A ^Pl?" rpvw 

JVr\oX^J!wXr I V X 


18 


SSX-2 42-49 


A WK'TPWV 

AOCiAJr I V X 


19 


SSX-2 53-61 


RK'VFAiv/mrr 

IVIV. I CAiYL 1 JVLr 


20 


SSX-2 52-61 


K"]? YF A \ATV: T 

r^JCvTw I ij/vlYi llvLr 


21 


SSX-2 54-63 




22 


SSX-2 55-63 




23 


SSX-2 56-63 




24 


HBV 18-27 




25 


HLA-B44 binder 


A FMrJlir V^F Y 
/A-L»ivxvjrv. X ox^ X 


26 


SSX-1 41-49 


rv. X L>X>fkXO X V 


27 


SSX-3 41-49 


KVSEBaVYV 


28 


SSX-4 41-49 


KSSEKIVYV 


29 


5SX-5 4M9 


KASEKHYV 


30 


PSMA163-192 


AFSPQGMPEGDLVYVNYARTEDFFKLERDM 


31 


PSMA 168-190 


3MPEGDLVYVNYARTEDFFKLER 


32 


PSMA 169-177 


VtPEGDLVYV 


33 


PSMA 168-177 < 


3MPEGDLVYV 


34 , 


^SMA 168-176 ( 


[jMPEGDLVY 


35 I 


^SMA 167-176 ( 


3GMPEGDLVY 


36 I 


*SMA 169-176 I 


ilPEGDLVY 
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37 


PSMA 171-179 


EGDLVYVNY 


38 


PSMA 170-179 


PEGDLVYVNY 


39 


PSMA 174-183 


LVYVNYARTE 


40 


PSMA 177-185 


VNYARTEDF 


41 


PSMA 176-185 


YVNYARTEDF 


42 


PSMA 178-186 


NYARTEDFF 


43 


PSMA 179-186 


YARTEDFF 


44 


PSMA 181-189 


RTEDFFKLE 


45 


PSMA 281-310 


RGIAEAVGLPSXPVHPIGYYDAQKLLEKMG 


46 


PSMA 283-307 


lAEAVGLPSIPVHPIGYYDAOKLLE 


47 


PSMA 289-297 


LPSffVHPI 


48 


PSMA 288-297 


GLPSIPVHPI 


49 


PSMA 297-305 


IGYYDAQKL 


50 


PSMA 296-305 


PIGYYDAQKL 


51 


PSMA 291-299 


SIPVHPIGY 


52 


PSMA 290-299 


PSIPVHPIGY 


53 


PSMA 292-299 


IPVHPIGY 


54 


PSMA 299-307 


YYDAQKLLE 


55 


PSMA454-481 


SSffiG>ryTLRVDCTPIJVlYSLVHLTKEL 


56 


PSMA 456-464 


iEGNYTLRV 


57 


PSMA 455-464' 


SIEGNYTLRV 


58 


PSMA 457^64 


EGNYTLRV 


59 


PSMA 461-469 


TLRVDCTPL 


60 


PSMA 460-469 


YTLRVDCTPL 


61 


PSMA 462-470 


LRVDCTPLM 


62 


i>SMA 463-471 


RVDCTPLMY 


63 


PSMA 462^71 


LRVDCTPLMY 


64 


PSMA653-687 




65 


PSMA 660-681 


VLRMMNDQLMFLERAFIDPLGL 


66 


PSMA 663-671 


MMNDQLMFL 


67 


PSMA 662-671 


RMMNDQLMFL 


68 


PSMA 662-670 


E^MMNDQLMF 


69 


ryr 1-17 


VILLAVLYCLLWSFOTSA 


Table IB. SEQ ID NOS.* including epitopes in Examples 14 and 15. 


SEQIDNO 


[DENTITY 


SEQUENCE 


70 


SPlOO protein 


** Accession numben P40967 


71 


VIAGE-1 protein 


Accession number: P43355 


72 


VlAGE-2 protein 


Accession number: P43356 


73 J 


V[AGE-3 protein 


Accession number: P43357 


74 J 


^-ESO-1 protein 


Accession number: P78358 


75 I 


!-AGE- la protein 


Accession numben CAAl 1116 
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76 


LAGE-lb protein 


Accession number: CAAl 1 1 17 


11 


FRAME protein 


Accession number: NP 006106 


78 


PSA protein 


Accession number P07288 


19 


PSCA protein 


Accession number 043653 


80 


GPlOO cds 


Accession number: U20093 


81 


MAGE-1 cds 


Accession numben M77481 


82 


MAGE-2cds 


Accession number L18920 


83 


MAGE.3 cds 


Accession number: U03735 


84 


NY-ESO-1 cDNA 


Accession number: U87459 


85 


FRAME cDNA 


Accession number: NM 006115 


86 


PSAcDNA 


Accession number: NM 001648 


87 


PSCAcDNA 


Accession number: AF043498 


88 


GPlOO 630-638 


LPHSSSHWL 


89 


GP 100 629-638 


QLPHSSSHWL 


90 


GPlOO 614-622 


LIYRRRLMK 


91 


GPlOO 613-622 


SLIYRRRLMK 


92 


GPlOO 615-622 


EYRRRLMK 


93 


GPlOO 630-638 


LPHSSSHWL 


94 


GPlOO 629-638 


QLPHSSSHWL 


95 


MAGE-1 95-102 


ESLFRAVI 


96 


MAGE-1 93-102 


DLESLFRAVI 


97 


lVCAGE-1 93-101 


DLESLFRAV 


98 


MAGE-1 92-101 


CILESLFRAV 


99 


MAGE-1 92-100 


CILESLFRA 


100 


MAGE-1 263-271 


EFLWGPRAL 


101 


MAGE-1 264-271 


FLWGPRAL 


102 


MAGE-1 264-273 


FLWGPRALAE 


103 


MAGE-1 265-274 


LWGPRALAET 


104 


MAGE-1 268-276 . 


PRALAETSY 


105 


MAGE-1 lei-llG 


GPRALAETSY 


106 


MAGE-1 269-277 


RALAETSYV 


107 


MAGE-1 271-279 


LAETSYVKV 


108 


MAGE-1 270-279 


ALAETSYVKV 


109 


MAGE-1 272-280 


AETSYVKVL 


110 


MAGE-1 271-280 


LAETSYVKVL 


HI 


MAGE-1 274-282 


rSYVKVLEY 


1 1 0 

1 IZ 




b 1 S YVKVLEY 


113 


MAGE-1 278-286 


KLVLEYVIKV 


114 


MAGE-1 168-177 


SYVLVTCLGL 


lis 


MAGE-l 169-177 


yVLVTCLGL 


116 


VlAGE-1 170-177 


VTLVTCLGL 


117 


VlAGE-1 240-248 


rQDLVQEKY 
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1 152 


LYl/vUrxi-l Zji'-Z^o 


L 1 V^UL. V V^JtlJv Y 


1 10 


ivyfAOP 1 010 o/ift 




ion 


l\yfArJT7 1 041 0^1 
LVLA.UrJtl-1 Z'fj-Zjl 




101 


NyfAOP 1 040 0<1 
iyLA.OrjCr-1 Z*rZ-xDl 


UL, V vjilK YLEY 


1 00 


lur A/^T7 1 01 A O10 
iVLrvVjrc-i ZJU-Z?o 


aAYOErRKL 


lO^ 


ViTA/^t? 1 O70 OQ/J 


IvVLEYVIKV 




lVlACjll-1 Z//-Z60 


T^'XT^ rTT/"\ T 

VKVLEYVIKV 




\>rA/^"D 1 OT^ I0i4 

iMAOli-l Z/0-zo4 


YVKVLEYVI 


10/^ 


\4'A/^17 1 0*7 >l OOO 

iVlAvjJi-l 


l^YVKVLEY 


107 


IViAOJti- L Z / J -zoZ 


bli> Y VK.VL,bY 


10R 


N/TArrP 1 Ofll 001 


VllvVoAKVK 


10Q 


IV/TAr^P 1 OSO 001 
LVLrt.vJIS-1 ZoZ-Zyi 


Y YlJvVaAKVK 


1 JV 


M'ArjP 0 11^ 190 




1 '\ 1 
1 J 1 


NAA^iT? 0 111 100 


\/r\.FX^r \/LJI7T T T 

M V JbJL V rlr LJLL 


1 


\;f A /^TT 0 1 AO 11^ 


rOT> tr'%. JTX rr?T 

IbRKMVEL 


1 


MALrii-z lUo-Ho 


AISRKMVEL 


1 1^ 


MAljb-Z 10 /-1 10 


AAISRKMVEL 


i JD 


MALjIi-Z llZ-lzU 


KMVELVHFL 


1 1A 


A r2X7 O 1 AO 1 1 T 


ISRKMVELV 




^/^ A 0 1 AC 1 1 T 


A TOT* /TK TT'^ T T r 

AISRKMVELV 




\/r A r^-Tj T 11/; 11/1 


LVHFLLLKY 


1^0 


MAvJJi-Z 11D-1Z4 


ELVHFLLLKY 




X/T A Till 110 


RKMVELVHF 


141 


lYlALril-Z 1 JO- 100 


LQLVFGIEV 




IVlAVJIl-Z lj/-100 


Yi,QLVrGIEV 




NAArvp 0 1^0 1^0 


l^JLVrCjJLEVV 


144 




La^L. Vr Vjib V V 


145 


\/fAOP-0 1/%4 170 


LbV VJbV Vri 


146 


^/fAfJF-O 1^1 170 


LritlV VliV VJrl 


147 


M" A nn 0 1 AO 1 OA 

IVi/\.Vj£r-Z 1 OZ- i /{J 


t^ulhVVEVV 


14R 


X/tAnrU OK/1 1 


A.SEYLQLVF 


140 


Vif Anx? 0 1 CI 1 /:o 


eCASEYLQLVF 


1 <(\ 


\aALrJi-Z z 1 6-225 


EEKIWEEL 


10 i 


WAvjli-Z Zlo-2z5 


\PEEKIWEEL 


1 ^0 


VlALrxi-z z 1 o-zzi 


A T> 1 1 'WTT^ r 1^ 

\PEEKIWE 


1 ^1 


VLACxii-Z zzU-Zzo 1 


CIWEELSML 


154 I 


SlAGE-2 219-228 1 


iKIWEELSMT 


155 I 


VlAGE-2 271-278 I 


^LWGPRAL 


156 r 


vIAGE-2 271-279 I 


^LWGPRALI 


157 I 


^GE-2 278-286 I 


.DSTSYVKV 


158 P 


^GE-2 277-286 i 


U.IETSYVKV 


159 ^ 


AAGE-2 276-284 F 


tALIETSYV 
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1 Cf\ 


MAGE-2 279-287 


lETSYVKVL 


161 


MAGE-2 278-287 


LIETSYVKVL 


162 


MAGE-3 lll'llZ 


FLWGPRAL 


163 


MAGE-3 270-278 


EFLWGPRAL 


164 


MAGE-3 271-279 


FLWGPRALV 


165 


MAGE-3 276-284 


RALVETSYV 


166 


MAGE-3 272-280 


LWGPRALVE 


167 


MAGE-3 271-280 


FLWGPRALVE 


168 


MAGE-3 27 2-281 


LWGPRALVET 


169 


N[Y-ESO-l 82-90 


GPESRLLEF 


170 


NFY-ESO-l 83-91 


PESRLLEFY 


171 


NY-ESO-1 82-91 


GPESRLLEFY 


172 


NY-ESO-1 84-92 


ESRLLEFYL 


173 


NY-ESO-1 86-94 


RLLEFYLAM 


174 


nY-ESO-1 88-96 


LEFYLAMPF 


175 


NY-ESO-1 87-96 


LLEFYLAMPF 


176 


NY-ESO-1 93-102 


AMPFATPMEA 


177 


NY-ESO-1 94-102 


MPFATPMEA 


178 


NY-ESO-1 115-123 


PLPVPGVLL 


179 


NY-ESO-1 114-123 


PPLPVPGVLL 


ISO 


NY-ESO-1 116-123 


LPVPGVLL 


181 


NY-ESO-1 103-112 


ELARRSLAQD 


182 


NY-ESO-I 118-126 


VPGVLLKEF 


183 


NY-ESO-1 117-126 


PVPGVLLKEF 


184 


NY-ESO-1 116-123 


LPVPGVLL 


185 


NY-ESO-1 127-135 


TVSGNILTI 


186 


NY-ESO-1 126-135 


FTVSGNILTI 


187 


NY-ESO-1 120-128 


GVLLBCEFTV 


loo 


NY-ESO-1 121-130 


VLLKEFTVSG 


189 


NY-ESO-1 122-130 


LLKEFTVSG 


1 Art 

190 


NY-ESO-1 118-126 


S^GVLLKEF 


191 


^^Y-ESO-l 117-126 


PVPGVLLKEF 


192 


W-ESO-1 139-147 


AADHRQLQL 


193 


^TY-ESO-l 148-156 


5ISSCLQQL 


194 I 


^nf-ESO-1 147-156 1 


LSISSCLQQL 


195 


^lY-ESO-l 138-147 


rAADHRQLQL 


196 1 






197 1 


>IY-ESO-l 157-165 I 


5LLMWrrQC 


198 I 


^-ESO-1 150-158 S 


JSCLQQLSL 


199 I 


sTY-ESO-l 154-162 C 


JQLSLLMWI 


200 f 


vfY-ESO-1 151-159 S 


5CLQQLSLL 


201 t 


^-ESO-1 150-159 S 


JSCLQQLSLL 
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202 


NY-ESO-1 163-171 


TQCFLPVFL 


203 


NY-ESOl 162-171 


rrQCFLPVFL 


204 


FRAME 2 19-227 


PMQDIKMIL 


205 


FRAME 218-227 


MPMQDIKMIL 


206 


FRAME 428-436 


QHLIGLSNL 


207 


FRAME 427-436 


LQHLIGLSNL 


208 


FRAME 429^36 


HLIGLSNL 


209 


FRAME 431-439 


[GLSNLTHV 


210 


FRAME 430-439 


LIGLSNLTHV 


211 


PSA 53-61 


VLVHPQWVL 


212 


PSA 52-61 


GVLVHPQWVL 


213 


PSA 52-60 


GVLVHPQWV 


214 


PSA 59-67 


WVLTAAHCI 


215 


PSA 54-63 


LVHPQWVLTA 


216 


PSA 53-62 


VLVHPQWVLT 


217 


PSA 54-62 


LVBDPQWVLT 


218 


PSA 66-73 


CIRNKSVI 


219 


PSA 65-73 


HCIRNKSVI 


220 


PSA 56-64 


HPQWVLTAA 


221 


PSA 63-72 


AAHCIRNKSV 


222 


PSCA 116-123 


LLWGPGQL 


223 


PSCA 115-123 


LLLWGPGQL 


224 


PSCA 114-123 


GLLLWGPGQL 


225 


PSCA 99-107 


ALQPAAAIL 


226 


PSCA 98-107 


pALQPAAAIL 


227 


Tyr 128-137 


APEKDKFFAY 


228 


ryr 129-137 


PEKDKFFAY 


229 


Tyr 130-138 


EKDKFFAYL 


230 


Tyr 131-138 


KDKFFAYL 


231 


Tyr 205-213 


PAFLPWHRL 


232 


Tyr 204-213 


APAFLPWHRL 


233 


lyr 214-223 


FLLRWEQEIQ 


234 


Tyr 2 12-220 


RLFLLRAVEQ 


235 


Tyr 191-200 


GSErWRDIDF 


236 


Fyr 192-200 


SEIWRDBDF 


237 


ryr 473-481 


EUWSWLLGA 




iyr 4/o-4o4 


SWLLGAAMV 


239 


ryr 477-486 


WLLGAAMVGA 


240 


ryr 478-486 


LLGAAMVGA 


241 1 


»SMA4-12 


LLHETDSAV 


242 I 


^SMA 13-21 , 


\TARRPRWL 


243 [ 


>SMA 53-61 


rPKHNMKAF 
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244 


PSMA 64-73 


ELKAENIKKF 


245 


PSMA 69-77 


NIKKFLH^NF 


246 


PSMA 68-77 


ENIKKFLH'NF 


247 


PSMA 220-228 


AGAKGVILY 


248 


PSMA 468-477 


PLMYSLVHNL 


249 


PSMA 469-477 


LMYSLVHNL 


250 


PSMA 463-471 


RVDCTPLMY 


251 


PSMA 465-473 


DCTPLMYSL 


252 


PSMA 507-5 15 


SGMPRISKL 


253 


PSMA 506-515 


FSGMPRISKL 


254 


NY-ESO-1 136-163 


RLTAADHRQLQI^SCLQQLSLLMWrr 


255 


NY-ESO-l 150-177 


SSCLQQLSLLMWTTQCFLPVFLAQPPSG 



* This H was reported as Y in the SWISSPROT database. 



Table IC. SEQ ID NOS * Incladmg epitopes in £xamplel4. 



SEQIDNO. 


IDENTITY 


SEQUENCE 


256 


Mage-1 125-132 


KAEMLESV 


257 


Mage-1 124-132 


TKAEMLESV 


258 


Mage-1 123-132 


VTKAEMLESV 


259 


Mage-1 128-136 


MLESVIKNY 


260 


Mage-1 127-136 


EMLESVIKNY 


261 


Mage-1 125-133 


KAEMLESVI 


262 


Mage-1 146-153 


KASESLOL 


263 


Mage-1 145-153 


GKASESLQL 


264 


Mage-1 147-155 


ASESLQLVF 


265 


Mage-1 153-161 


LVFGIDVBLE 


266 


Mage-1 114-121 


LLKYRARE 


267 


Mage-1 106-113 


VADLVGFL 


268 


Magc-1 105-113 


KVADLVGFL 


269 


Magc-1 107-115 


ADLVGFLLL 


270 


Mage-1 106-115 


VADLVGFLLL 


271 


Mage-1 114-123 


LLKYRAREPV 


272 


Mage-3 278-286 


LVETSYVKV 


273 


Mage-3 277-286 


ALVETSYVKV 


274 


Mage-3 285-293 


KVLHHMVKI 


275 


Mage-3 283-291 


YVKVLHHMV 


276 


Mage-3 275-283 


PRALVETSY 


277 


Mage-3 274-283 


GPRALVETSY 


278 


Mage-3 278-287 


LVETSYVKVL 


279 


ED-B4'-5 


TIIPEVPOL 


280 


ED-B5'-5 


DTUPEVPOL 


281 


ED-B 1-10 


EVPOLTDLSF 


282 


ED-B 23-30 


TPLNSSTI 


283 


ED-B 18-25 


IGLRWTPL 


284 


ED-B 17-25 


SIGLRWTPL 


285 


ED-B 25-33 


LNSSTHGY 


286 


ED-B 24-33 


PLNSSTHGY 
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287 |ED-B 23-31 


TPLNSSXn 


288 


ED-B 31-38 


IGYRirW 


289 


ED-B 30-38 


nGYRTTW 


290 


ED-B 29-38 


TIIGYRITVV 


291 


ED-B 31-39 


IGYRTTWA 


292 


ED-B 30-39 


HGYRITWA 


293 


CEA 184-191 


SLPVSPRL 


294 


CEA 183-191 


QSLPVSPRL 


295 


CEA 186-193 


PVSPRLOL 


296 


CEA 185-193 


LPVSPRLOL 


297 


CEA 184-193 


SLPVSPRLQL 


298 


CEA 185-192 


LPVSPRLO 


299 


CEA 192-200 


QLSNGNRTL 


300 


CEA 191-200 


LQLSNGNRTL 


301 


CEA 179-187 


WVNNQSLPV 


302 


CEA 186-194 


PVSPRLOLS 


303 


CEA 362-369 


SLPVSPRL 


304 


CEA 361-369 


QSLPVSPRL 


305 


CEA 364-371 


PVSPRLOL 


306 


CEA 363-371 


LPVSPRLOL 


307 




SLPVSPRLQL 


308 


TEA 3^53-'? 70 


LPVSPRLQ 


309 


CEA 370-378 


QLSNDNRTL 


310 


CEA 369-378 


LQLSNDNRTL 


311 


CEA 357-365 


WVNNQSLPV 


312 


CFA 3fiO-'^68 


NQSLPVSPR 


313 


CFA 540-54.7 


SLPVSPRL 


314 


CEA 539-547 


QSLPVSPEIL 


315 


CEA 542-549 


PVSPRLOL 


316 


CEA 541-549 


LPVSPRLOL 


317 


CEA 540-549 


SLPVSPRLQL 


318 


CEA 541-548 


LPVSPRLO 


319 


CEA 548-556 


QLSNGNRTL 


320 


CEA 547-556 


LQLSNGNRTL 


321 


CEA 535-543 


WVNGQSLPV 


322 


CEA 533-541 


LWWVNGQSL 


323 


CEA 532-541 


YLWWVNGQSL 


324 


CEA 538-546 


GQSLPVSPR 


325 


Her-2 30-37 


DMKLRLPA 


326 


Her-2 28-37 


GTDMKLRLPA 


327 


£Xvx ^ 


HLDMLRHL 


328 


Her-2 41.dQ 


THLDMLRHL 


329 


Her-2 40-49 


ETHLDMLRHL 


330 


Her-2 36-43 


PASPETHL 


331 


Her-2 35-43 


LPASPETHL 


332 


Her-2 34-43 


RLPASPETEIL 


333 


Her-2 38-46 


SPETHLDML 


334 


Her-2 37-46 


ASPETHLDML 


335 


Her-2 42-50 


HLDMLRHLY 


336 


Her-2 41-50 


THLDMLRHLY 


337 


Her-2 719-726 


ELRKVKVL 
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Her-2 718-726 


TELRKVKVL 


•3 J7 


Her-2 717-726 


ETELRKVKVL 




Her-2 715-723 


LKETELRKV 




Hpr-2 1^A~1T\ 


rLKETELRKV 




Her-2 712-720 

XlwL Z> / 1 A*— / JUKI 


MRILKETEL 




TTpr-2 71 1 -770 


OMRILKETEL 

V^IVXXXXX^I/WJLJ X ' '* * 


JnH 


Hpr-9 717-72S 


ETELRKVKV 




Hpr-9 716-72S 


KETELRKVKV 




Rm--9 70*1-714 


MPNOAOMRI 




Hw-9 7nS-714 


AMPNOAOMRI 


j^O 


Wpt 9 70/^-7 1 S 


MPNOAOMRBL 






RPRFRELV 

XVX XvX X> 1 / 1 < V 






CRPRFRELV 

VvXXX XvX X\ 1/1.; V 






PFRFT VSPF 

XVT X\ i « 1 t V OX^ 




WPP -9 Q/^7-<)7fi 


PRFRELVSEF 

X XVf lXi.*l,* ▼ ox>x 




WPP -9 Q/i4-079 


FrRPRPRFT 






GAASGLNGC 






RASGPGGGA 




MY-P^O-l 64-79 


PHGOAASGL 


J J / 


xjv PQO-1 6*^-79 

IN I -Hoi-/- 1 OO- / X 


VJx xxvJVJj^-rvo\JX^ 


JDo 


xrv P<2n-i 60-AQ 


APPrrPHGTrA A 
rvx r\.vJi XX vjvjxvfv 






VPPPPWK'T 

V JSJr JSJV YV AJLi 




pi> AXyrp 111 1 1Q 


PVPPPPWK'T 


JQi 


P"R AMT? 1 1^^-191 

iJV/VlVJLC 


PPPP Wl^T OV 




PT? AA/TP 1 14-199 


PPPWTTT OVT 




PP ATV/TP 1 1'^-199 


PPPPWliTT OVT 

iXX IVXV T V lVX_»V^ V Lj 


JU*T 


PP AK/TP 1 16-194 


PWK"! OVT r>T 

1\. VY JSAJy^ Y Xjk^A-j 




PPAIUP 1 1S-194 

XXW^JYl-C* 1 1*/— l^t 


RRWKT OVLDL 




PPAMP 174-1X9 


PVEVLVDLF 

X^ V X> V Xj Y X^X^X 




PPA^/TP 10Q-9n6 


V JVX\.^.J%^N V X-/ 




PPAMP 108-906 


PTWPK'K'NVT 




PPATUP 107-906 


PIirVTrRKK"NVT 

£r£V. V Jkf\f\ff^>i V 1j 


370 


PRAMP 198-205 


KVKRKKNV 

XV V IVXxlV IN 1^ V 




PPA\TP 201-90R 


PKKNVT RL 




PPAMP 200-90R 


k:pk:k:mvt rt 


373 


X jvrvivxXii X77— 




374 


PR AMP 1R9-1Q6 

X XV/il1VXX> lO^ 


DPT P<?YT T 


375 


PPAMP 90S-9t'^ 

X XVcVLVXX^ £m\JJ^At i J 


VT PT rPKKT 


376 


PP AMP 904-9 1 


NVT P T PrTCTCT 


'Ml 


PPAMP 104-9^9 


VT TPK'VT^RTr 


J 1 a 


PP AMP 74-Rl 


riAWPPTPT 


•J / 


PPAMP I'X R1 


Vn A WPTTTr'T 




PPAMP 79-ftl 


M\7r\ A WPT7TPT 


381 


FRAME 81-88 


LPLGVLMK 


382 


FRAME 80-88 


CLFLGVLMK 


383 


FRAME 79-88 


TCLPLGVLMK 


384 


FRAME 84-92 


GVLMKGQHL 


385 


FRAME 81-89 


LPLGVLMKG 


386 


FRAME 80-89 


CLPLGVLMKG 


387 


FRAME 76-85 


WPFTCLPLGV 


388 


FRAME 51-59 


ELFPPLFMA 
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389 


FRAME 49-57 


PRELFPPUF 


390 


FRAME 48-57 


LPRELFPPLF 


391 


FRAME 50-58 


RELFFPLFM 


392 


FRAME 49-58 


PRELFPPLFM 


393 


FSA 239-246 


RPSLYTKV 


394 


PSA 238-246 


ERPSLYTKV 


395 


PSA 236-243 


LPERPSLY 


396 


PSA 235-243 


ALFERPSLY 


397 


PSA 241-249 


SLYTKWHY 


398 


PSA 240-249 


PSLYTKWHY 


399 


PSA 239-247 


RFSLYTKW 


400 


PSMA 211-218 


GNKVKNAQ 


401 


PSMA 202-209 


lARYGKVF 


402 


PSMA 217-225 


AQLAGAKGV 


403 


PSMA 207-215 


KVFRGNKVK 


404 


PSMA 21 1-219 


GNKVKNAQL 


405 


PSMA 269-277 


TPGYPANEY 


406 


PSMA 268-277 


LTPGYPANEY 


407 


PSMA 271-279 


GYPANEYAY 


408 


PSMA 270-279 


PGYPANEYAY 


409 


PSMA 266-274 


DPLTPGYPA 


410 


PSMA 492-500 


SLYESWTBCK 


411 


PSMA 491-500 


KSLYESAVTKK 


412 


PSMA 486-494 


EGFEGKSLY 


413 


PSMA 485-494 


DEGFEGKSLY 


414 


PSMA 498-506 


TKKSPSPEF 


415 


PSMA 497-506 


WTKKSPSPEF 


416 


PSMA 492-501 


SLYESWTKKS 


417 


PSMA 725-732 


WGEVBCRQI 


418 


PSMA 724-732 


AWGEVKRQI 


419 


PSMA 723-732 


KAWGEVKRQI 


420 


PSMA 723-730 


KAWGEVKR 


421 


PSMA 722-730 


SKAWGEVKR 


422 


PSMA 73 1-739 


QIYVAAFTV 


423 


PSMA 733-741 


YVAAFTVOA 


424 


PSMA 725-733 


WGEVKRQIY 


425 


PSMA 727-735 


EVBCRQIYVA 


426 


PSMA 738-746 


TVOAAAETL 


427 


PSMA 737-746 


FTVQAAAETL 


428 


PSMA 729-737 


KRQIYVAAF 


429 


PSMA 721-729 


PSKAWGEVK 


430 


PSMA 723-731 


KAWGEVKRO 


431 


PSMA 100-108 


WKEFGLDSV 


432 


PSMA 99-108 


QWKEFGLDSV 


433 


PSMA 102-111 


EFGLDSVELA 


434 


SCP-1 126-134 


ELROKESKL 


435 


SCP-1 125-134 


AELRQKESKL 


436 


SCP-l 133-141 


KLQENRKH 


437 


SCP-l 298-305 


QLEEKTKL 


438 


SCP-1 297-305 


NQLEEKTKL 


439 


SCP-1 288-296 


LLEESRDKV 
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440 


SCP-1 287-296 


FLLEESRDKV 


441 


SCP-1 291-299 


ESRDKVNQL 


442 


SCP-1 290-299 


EESRDKVNQL 


443 


SCP-1 475-483 


EBCEVHDLEY 


444 


SCP-1 474-483 


REKEVHDLEY 

X^ 1 ^X \>JU< T XAJ VI «l ) X 


445 


SCP-1 480-488 


DLEYSYCHY 


446 


SCP-1 477-485 


EVHDLEYSY 


447 


SCP-1 477-486 


EVHDLEYSYC 


448 


SCP-1 502-509 


KLSSKREL 


449 


SCP-1 508-515 


ELKNTEYF 

* ^ ' ^ ■ ^ ' ^ X X^ X X 


450 


SCP-1 507-515 


RELKNTEYF 


451 


SCP-1 496-503 


KRGQRPKL 


452 


SCP-1 494-503 


LPKRGQRPKL 


453 


SCP-1 509-517 


IJKKrEYFTL 


454 


SCP-1 508-517 


ELKNTEYFTL 


455 


SCP-1 506-514 


KRELKNTEY 


456 


SCP-1 502-510 


KLSSKRELK 


457 


SCP-1 498-506 


GORPKLSSK 


458 


SCP-1 497-506 


RGORPKLSSK 


459 


SCP-1 500-508 


RPKLSSKRE 


460 


SCP-1 573-580 


LEYVREEL 

X_/X^ X V XN 1 <1 /X^ 


461 


SCP-1 572-580 


ELEYVREEL 


462 


SCP-1 571-580 


NELEYVREEL 


463 


SCP-1 579-5S7 


ELKOKREDEV 

J^X^XX.\^X^JL\X^J_^X^ V 


464 


OV_/X 1 J 1 J ■JO J 


YVRFFI KOK 


465 


SCP-1 632-640 


OLNVYEIKV 


466 


SCP-1 630-638 


SKOLNVYEI 


467 


SCP-1 628-636 


AESKOLNVY 


468 


SCP-1 627-636 


TAESKOLNVY 


469 


SCP-1 638-645 


IKVNKLEL 


470 


SCP-1 637-645 


EIKVNKLEL 


471 


SCP-1 636-645 


YEIKVNKLEL 


472 


SCP-1 642-650 


KLELBLESA 


473 


SCP-1 635-643 


VYEIKVNKL 


474 


SCP-1 634-643 


NVYEIKVNKL 


475 


SCP-1 646-654 


ELESAKQKF 


476 


SCP-1 642-650 


KLELELESA 

II rl if f ^ 1^ mm *i * 1 X 


477 


SCP-1 646-654 


ELESAKOKF 

1 «l «l <>jJrkxv\^xVA 


478 


SCP-1 771-778 


KEBCLKREA 


479 


SCP-1 777-785 


EAKENTATL 


480 


SCP-1 776-785 


REAKENTATL 

XXX^x^XVX^l^ X X x^ 


481 


SCP-1 n^'izi 


KLKREAICENT 


482 


SCP-1 112-119 


EAEKIKKW 

x>Ax-L» JV i Pfc rv vv 


483 


SCP-1 101-109 


GLSRVYSKL 


484 


SCP-1 100-109 


EGLSRVYSKL 


485 


SCP-1 108-116 


KLYKEAEKI 


486 


SCP-1 98-106 


NSEGLSRVY 


487 


SCP-1 97-106 


ENSEGLSRVY 


488 


SCP-1 102-110 


LSRVYSKLY 


489 


SCP-1 101-110 


GLSRVYSKLY 


490 


SCP-1 96-105 


LENSEGLSRV 
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491 


SCP-l 108-117 


KLYKEAEKK 


492 


SCP-1 949-956 


REDRWAVI 


493 


SCP-1 948-956 


MREDRWAVI 


494 


SCP-1 947-956 


KMREDRWAVI 


495 


SCP-1 947-955 


KMREDRWAV 


496 


SCP-1 934-942 


TTPGSTLKF 


497 


SCP-1 933-942 


LTTPGSTLKF 


498 


SCP-1 937-945 


GSTLKGAI 


499 


SCP-1 945-953 


IRKMREDRW 


500 


SCP-1 236-243 


RLEMHFKL 


501 


SCP-1 235-243 


SRLEMHFKL 


502 


SCP-1 242-250 


KLKEDYEKI 


503 


SCP-1 249-257 


KIQHLEQEY 


504 


SCP-1 248-257 


EKIQHLEQEY 


505 


SCP-1 233-242 


ENSRLEMHF 


506 


SCP-1 236-245 


RLEMHFKLKE 


507 


SCP-1 324-331 


LEDDCVSL 


508 


SCP-1 323-331 


ELEDIKVSL 


509 


SCP-l 322-331 


KELEDIKVSL 


510 


SCP-1 320-327 


LTKELEDI 


511 


SCP-1 319-327 


HLTKELEDI 


512 


SCP-1 330-338 


SLQRSVSTQ 


513 


SCP-1 321-329 


TKELEDIKV 


514 


SCP-l 320-329 


LTKELEDIKV 


515 


SCP-1 326-335 


DIKVSLQRSV 


516 


SCP-1 281-288 


KMKDLTFL 


517 


SCP-1 280-288 


NKMKDLTFL 


518 


SCP-1 279-288 


ENKMKDLTFL 


519 


SCP-1 288-296 


LLEESRDKV 


520 


SCP-1 287-296 


FLLEESRDKV 


521 


SCP-1 291-299 


ESRDKVNQL 


522 


SCP-1 290-299 


EESRDKVNQL 


523 


SCP-1 277-285 


EKENKMKDL 


524 


SCP-1 276-285 


TEKENKMKDL 


525 


SCP-1 279-287 


ENKMKDLTF 


526 


SCP-1 218-225 


lEKMITAF 


527 


SCP-1 217-225 


NIEKMITAF 


528 


SCP-1 216-225 


SNIEKMITAF 


529 


SCP-1 223-230 


TAFEELRV 


530 


SCP-1 222-230 


ITAFEELRV 


531 


SCP-1 221-230 


MTTAFEELRV 


532 


SCP-1 220-228 


KMTTAFEEL 


533 


SCP-1 219-228 


EKMITAFEEL 


534 


SCP-1 227-235 


ELRVQAENS 


535 


SCP-1 213-222 


DLNSNIEKMI 


536 


SCP-l 837-844 


WTSAKNTL 


537 


SCP-1 846-854 


TPLPKAYTV 


538 


SCP-1 845-854 


STPLPKAYTV 


539 


SCP-1 844-852 


LSTPLPKAY 


540 


SCP-1 843-852 


TLSTPLPKAY 


541 


SCP-1 842-850 


NTLSTPLPK 
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542 


SCP-1 841-850 


KNTLSTPLPK 


543 


SCP-1 828-835 


ISKDKRDY 


544 


SCP-1 826-835 


HGISKDKRDY 


545 


SCP-1 832-840 


KRDYLWTSA 


546 


SCP-1 829-838 


SKDKRDYLWT 


547 


SCP-1 279-286 


ENKMECDLT 


548 


SCP-1 260-268 


EINDKEKOV 


549 


SCP-1 274-282 


QITEKENKM 


550 


SCP-1 269-277 


SLLLIOrrE 


551 


SCP-1 453-460 


FEKIAEEL 


552 


SCP-1 452-460 


QFEKIAEEL 


553 


SCP-1 451-460 


KOFEKIAEEL 


554 


SCP-1 449-456 


DNKQFEKI 


555 


SCP-1 448-456 


YDNKQFEKI 


556 


SCP-1 447-456 


LYDNKOFEKT 


557 


SCP-1 440-447 


LGEKETLL 


558 


SCP-1 439-447 


VLGEKETLL 


559 


SCP-1 438-447 


KVLGEKETLL 


560 


SCP-1 390-398 


LLRTEQQRL 


561 


SCP-1 389-398 


ELLRTEOQRL 


562 


SCP-1 393-401 


TEQQRLENY 


563 


SCP-1 392-401 


RTEQQRLENY 


564 


SCP-1 402-410 


EDQLIILTM 


565 


SCP-1 397-406 


RLENYHDQLI 


566 


SCP-1 368-375 


KARAAHSF 


567 


SCP-1 376-384 


WTEFETTV 


568 


SCP-1 375-384 


FWTEFETTV 


569 


SCP-1 377-385 


VTEFETTVC 


570 


SCP-1 376-385 


WTEFETTVC 


571 


SCP-1 344-352 


DLOIATNTI 


572 


SCP-1 347-355 


lATNTICQL 


573 


SCP-1 346-355 


QIATNTICQL 


574 


SSX4 57-65 


VMTKLGFKY 


575 


SSX4 53-61 


LNYEVMTKL 


576 


SSX4 52-61 


KLNYEVMTKL 


577 


SSX4 66-74 


TLPPFMRSK 


578 


SSX4 110-118 


KIMPKKPAE 


579 


SSX4 103-112 


SLORIFPKIM 


580 


Tyr 463-471 


YKSYLEQA 


581 


Tyr 459^67 


SFQDYIKSY 


582 


Tyr 458-467 


DSFQDYIKSY 


583 


Tyr 507-514 


LPEEKOPL 


584 


Tyr 506-514 


OLPEEKOPL 


585 


Tyr 505-514 


KQLPEEKQPL 


586 


Tyr 507-515 


LPEEKQPLL 


587 


Tyr 506-515 


QLPEEKQPLL 


588 


Tyr 497-505 


SLLCRHKRK 


589 


ED-B domain of 


EVPQLTDLSFVDITDSSrGLRWTPLNSSTnGYRI 




Fibronectin 


TWAAGEGIPIFEDFVDSSVGYYTVTGLEPGID 






YDISVTTLINGGESAPTTLTQQT 


590 


ED-B domain of 


CTFDI^SPGLEYNVSVYTVKDDKESVPISDTIIP 
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x*iurQiiccun wiui 
fiaxUdng sequence 
■from Fribronectin. 


EVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRI 
TWAAGEGPIFEDFVDSSVGYYTVTGLEPGID 
YDISVrriJNGGESAPTTLTQQT 
AVPPPrnLRFTNIGPDTMRVTW 


SOI 


ED-B domain of 
Fibronectin cds 


Accession number. X07717 


592 


CEA protein 


Accession number: P0673 1 


593 


CEAcDNA 


Accession number: NM 004363 


594 


Her2/Neu protein 


Accession number: P04626 


595 


Her2/Neu cDNA 


Accession number: Ml 1730 


596 


SCP-1 protein 


Accession number: Q15431 


597 


SCP-1 cDNA 


Accession number: X95654 


598 


SSX-4 protein 


Accession number: 060224 


599 


SSX-4cDNA 


Accession number: NM 005636 



be useful as epitopes in any of the various embodiments of the invention. Any of SEQ ID NOS. 10, 
30, 31, 45, 46, 55, 64, 65, 69, 254, and 255 can be useful as sequences containing epitopes or 
epitope clusters, as described in various embodiments of the mvention. 

5 

**A11 accession numbers used here and throughout can be accessed through the NCBI databases, 
for example, through the Bntrez seek and retrieval system on the world wide web. 

Note that the following discussion sets forth the inventors' understanding of the operation 

10 of the invention. However, it is not intended that this discussion limit the patent to any particular 

theory of operation not set forth in the claims. 

In pursuing the development of epitope vaccines others have generated lists of predicted 
epitopes based on MHC binding motifs. Such peptides can be immunogenic, but may not 
correspond to any naturally produced antigenic fragment so ttiat whole antigen will not elicit a 

15 similar response or sensitize a target cell to cytolysis by CTL, Therefore such lists do not 
differentiate between those sequences that can be usefid as vaccines and those that cannot Efforts 
to determine which of these predicted epitopes are in fact naturally produced have often relied on 
screening tiieir reactivity with tumor infiltrating lyn^hocytes (TIL). However, TIL are strongly 
biased to recognize immune epitopes whereas tumors (and chronically infected cells) will gaierally 

20 present housekeeping epitopes. Thus, unless the epitope is produced by both the housekeeping and 

immuno- proteasomes, the target cell will generally not be recognized by CTL induced with TIL- 
identified epitopes. The epitopes of the present invention, in contrast, are generated by the action a 
specified proteasome. indicating that they can be naturally produced, and enabling then" appropriate 
use. The inq)ortance of the distinction between housekeeping and immune epitopes to vaccine 

25 design in more fully set forth in PCT publication WO 01/82963A2. 

The epitopes of the invention include or encode polypeptide fragments of TAAs that are 
precursors or products of proteasomal cleavage by a housekeeping or immune proteasome, and that 
have known or predicted affinity for at least one allele of MHC I. In some embodiments, the 
epitopes include or encode a polypeptide of about 6 to 25 amino acids in length, preferably about 7 
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to 20 amino acids in length, more preferably about 8 to 15 amino acids in leng&, and stiU more 
preferably 9 or 10 amino acids in lengfli. However, it is understood that the polypeptides can be 
larger as long as they do not contain sequences that cause the polypeptides to be directed away 
from the proteasome or to be destroyed by the proteasome. For immune epitopes, if the larger 
peptides do not contain such sequences, they can be processed in the pAPC by the immune 
proteasome. Housekeeping epitopes may also be embedded in longer sequences provided that the 
sequence is adapted to feUcitate liberation of the epitope^s Otermmus by action of the 
immunoprotcasome. The sequences of these epitopes can be subjected to computer analysis in 
order to calculate physical, biochemical, immunologic, or molecular genetic properties such as 
mass, isoelectric point, predicted mobility in electrophoresis, predicted binding to other MHC 
molecules, melting temperature of nucleic acid probes, reverse translations, similarity or homology 
to other sequences, and the like. 

constructing the polynucleotides encoding the polypeptide epitopes of the invention, the 
gene sequence of the associated TAA can be used, or the polynucleotide can be assembled from 
any of the corresponding codons. For a 10 amino acid epitope this can constitute on the order of 
10* different sequences, dependmg on the particular amino acid composition. While large, this is a 
distinct and readily definable set representing a miniscule fraction of the >10'* possMe 
polynucleotides of this length, and thus in some embodiments, equivalents of a particular sequence 
disclosed herein encon^ass such distinct and readily definable variations on the listed sequence. 
In choosing a particular one of these sequences to use in a vaccine, considerations such as codon 
usage, self-complementarity, restriction sites, chemical stability, etc. can be used as will be 
apparent to one skilled in the art 

The invention contemplates producing peptide epitopes. Specifically these epitopes are 
derived from the sequence of a TAA, and have known or predicted affinity for at least one allele of 
MHC I. Such epitopes are ^ically identical to those produced on target cells or pAPCs. 
Compositions Containing Active Epitopes 

Embodiments of the present invention provide polypeptide compositions, including 
vaccines, therapeutics, diagnostics, pharmacological and pharmaceutical compositions. The 
various compositions include newly identified epitopes of TAAs, as well as variants of these 
epitopes. Other embodiments of the invention provide polynucleotides encoding the polypeptide 
qjitopes of the invention. The invention further provides vectors for expression of die polypeptide 
epitopes for purification, hi addition, the invention provides vectors for the expression of the 
polypeptide epitopes in an APC for use as an anti-tumor vaccine. Any of the epitopes or antigens, 
or nucleic acids encoding the same, from Table 1 A can be used. Other embodiments relate to 
methods of making and using the various compositions. 
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A general architecture for a class I NfHC-bmding epitope can be described, and has been 
reviewed more extensively in Madden, D.R. Anna. Rev. Immunol 13:587-622, 1995. Much of the 
binding energy arises from main chain contacts between conserved residues in the MHC molecule 
and the and C-tennini of the peptide. Additional main chain contacts are made but vary among 
MHC alleles. Sequence specificity is confeired by side chain contacts of so-called anchor residues 
with pockets that, again, vary among MHC alleles. Anchor residues can be divided into primary 
and secondary. Primary anchor positions exhibit strong preferences for relatively well-defined sets 
of amino acid residues. Secondary positions show weaker and/or less well-defined preferences that 
can often be better described in terms of less favored, rather than more favored, residues. 
Additionally, residues in some secondary anchor positions are not always positioned to contact die 
pocket on the MHC molecule at all. Thus, a subset of peptides exists that bind to a partici^lar MHC 
molecule and have a side chain-pocket contact at die position in question and another subset exists 
that show binding to the same MHC molecule diat does not depend on the conformation the 
peptide assumes in the pq>tide-binding groove of tiie MHC molecule. The C-terminal residue (P ) 
is preferably a primary anchor residue. For many of the better studied HLA molecules (e.g. A2, 
A68, B27, B7, B35, and B53) the second position (P2) is also an anchor residue. However, central 
anchor residues have also been observed including P3 and P5 in HLA-B8, as well as P5 and P -3 
in the murine MHC molecules H-2D** and H-2K^, respectively. Since more stable binding v/ill 
generally improve immunogenicity, anchor residues are preferably conserved or optimized in the 
design of variants, regardless of their position. 

Because the anchor residues are generally located near the ends of the epitope, the peptide 
can buckle upward out of the peptide-binding groove allowing some variation in length. Epitopes 
ranging firom 8-11 amino acids have been found for HLA-A68, and up to 13 amino acids for HLA- 
A2. In addition to length variation between the anchor positions, single residue truncations and 
extensions have been reported and the N- and C-tcrmini, respectively. Of the non-anchor residues, 
some point up out of the groove, making no contact with the MHC molecule but being available to 
contact the TCR, very often PI, P4, and P -1 for HLA-A2. Others of the non-anchor residues can 
become interposed between the upper edges of the p^tide-bindmg groove and the TCR, contacting 
both. The exact positioning of these side chain residues, and thus their effects on binding, MHC 
fine conformation, and ultimately immunogoiicity, are highly sequence dependent. For an epitope 
to be highly immunogenic it must not only promote stable enougji TCR binding for activation to 
occur, but the TCR must also have a high enough off-rate that multiple TCR molecules can interact 
sequentially with the same peptide-MHC con^lex (Kalergis, A.M. et al.. Nature Immunol. 2:229- 
234, 2001). Thus without further information about the ternary complex, both conservative and 
non-conservative substitutions at these positions merit consideration when designing variants. 
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The polypeptide epitope variants can be made, for example, using any of tiie techniques 
and guidelines for conservative and non-conservative mutations. Variants can be derived from 
substitution, deletion or insertion of one or more amino acids as conopared with the native 
sequence. Amino acid substitutions can be the result of replacing one amino acid with another 
5 amino acid having similar structural and/or chemical properties, such as the replacement of a 
threonine with a serine. Such replacements are referred to as conservative amino acid 
replacements, and all appropriate conservative amino acid replacements are considered to be 
embodiments of one invention. Lisertions or deletions can optionally be in the range of about 1 to 
4, preferably 1 to 2, amino acids. It is generally preferable to maintain the "anchor positions" of 

10 the peptide which are responsible for binding to the MHC molecule in question, indeed, 
* immimogenicity of peptides can be improved in many cases by substituting more preferred residues 
at tiie anchor positions (Franco, et al.. Nature Immunology, 1(2): 145-150, 2000). Immunogenicity 
of a peptide can also often be improved by substituting bulkier amino acids for small amino acids 
found in non-anchor positions while maintaining sufficient cross-reactivity with the original 

15 epitope to constitute a useful vaccine. The variation allowed can be determined by routine 

insertions, deletions or substitutions of amino acids in the sequence and testing the resulting 
variants for activity exhibited by the polypeptide epitope. Because the polypeptide epitope is often 
9 amino acids, the substitutions preferably are made to the shortest active epitope, for example, an 
epitope of 9 amino acids. 

20 Variants can also be made by adding any sequence onto the N-terminus of the polypeptide 

epitope variant. Such N-terminal additions can be from 1 amino acid up to at least 25 amino acids. 
Because peptide epitopes are often trimmed by N-terminal exopeptidases active in the pAPC, it is 
understood that variations in the added sequence can have no effect on tiie activity of the epitope. 
In preferred embodiments, the amino acid residues between the last upstream proteasomal cleavage 

25 site and the N-terminus of the MHC epitope do not include a proline residue. Serwold, T, at al., 

Nature Immunol. 2:644-651, 2001. Accordingly, effective epitopes can be generated from 
precursors larger than the preferred 9-mer class I motif. 

Peptides arc useful to the extent that they correspond to epitopes actually displayed by 
MHC I on the surfece of a target cell or a pACP. A single peptide can have varying affinities for 

30 different MHC molecules, binding some well, others adequately, and still others not appreciably 
(Table 2). MHC alleles have traditionally been grouped according to serologic reactivity which 
does not reflect the structure of the peptide-binding groove, which can differ among different 
alleles of the same type. Similarly, binding properties can be shared across types; groups based on 
shared binding properties have been termed supertypes. There are numerous alleles of MHC I in 

35 the human population; epitopes specific to certain alleles can be selected based on the genome of 

the patient. 
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Table 2. 

Predicted Binding of Tvrosinaseiff,.^ii; fSEO ID NO. 1^ to Various MHC types 



MHCItype 


*Halftime of 


dissociation (rain) 


Al 


0.05 


A*0201 


1311. 


A*0205 


50.4 


A3 


2.7 


A*l 101 (part of the A3 supertype) 


0.012 


A24 


6.0 


B7 


4.0 


B8 


8.0 


B14 (part of the B27 supertype) 


60.0 


B*2702 


0.9 


B*2705 


30.0 


B*3501 (part of the B7 supertype) 


2.0 


B*4403 


0.1 


B*5101 (part of the B7 supertype) 


26.0 


B*5102 


55.0 


B*5801 


0.20 


B60 


0.40 


B62 


2.0 



*HLA Peptide Binding Predictions (internet http:// access at binias.dcrtJiih.gov/molbio/hIa_bin) 

5 

fa further embodiments of the invention, the epitope, as peptide or encoding 
polynucleotide, can be administered as a vaccine or immimogenic composition, alone or in 
combination with various adjuvants, carriers, or excipients. It should be noted that although the 
term vaccine may be used herein, the discussion can be applied and used with any of the other 

10 compositions mentioned herein. Particularly advantageous adjuvants include various cytokines and 

• oligonucleotides containing immunostimulatory sequences (as set forth in greater detail in the co- 
pending applications referenced herein). Additionally the polynucleotide encoded epitope can be 
contained in a virus (e.g. vaccinia or adenovirus^ or in a microbial host cell (e.g. Salmonella or 
Listeria monocytogenes) which is then used as a vector for the polynucleotide (Dietrich, G. et al. 

15 Nat. Biotech. 16:181-185, 1998). Alternatively a pAPC can be transformed, ex vfvo, to express the 

epitope, or pulsed with peptide epitope, to be itself administered as a vaccine. To increase 
efficiency of these processes, the encoded epitope can be carried by a viral or bacterial vector, or 
complexed with a ligand of a receptor found on pAPC. Similarly the peptide epitope can be 
complexed with or conjugated to a pAPC ligand A vaccine can be composed of more than a single 

20 epitope. 

Particularly advantageous strategies for incorporating epitopes, and combining them with 
epitope clusters, into a vaccine are disclosed in U.S. Patent Application No. 09/560,465 entitled 
"EPITOPE SYNCHRONIZATION IN ANHGEN PRESENTING CELLS," filed on April 28, 
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2000. J^itope clusters for use in connection with fliis invention are disclosed in U.S. Patent 
Application No. 09/561,571 entitled '"EPITOPE CLUSTERS," filed on April 28, 2000. 

Preferred embodiments of the present invention are directed to vaccines and methods for 
causing a pAPC or population of pAPCs to present housekeeping epitopes that correspond to the 
epitopes displayed on a particular target cell. Any of the epitopes or antigens in Table 1 A, can be 
used for example. In one embodiment, the housekeeping epitope is a TuAA epitope processed by 
the housekeeping proteasome of a particular tumor type. In another embodiment, the housekeeping 
epitope is a virus-associated epitope processed by the housekeeping proteasome of a cell infected 
with a virus. This facilitates a specific T cell response to the target cells. Concurrent expression 
by the pAPCs of multiple epitopes, corresponding to different induction states (pre- and post- 
attack), can drive a CTL response effective against target cells as they display either housekeeping 
epitopes or inmiune epitopes. 

By having both housekeeping and immune epitopes present on the pAPC, this embodiment 
can optimize the cytotoxic T cell response to a target cell. Witii dual epitope expression, tibe 
pAPCs can continue to sustain a CTL response to the immime-type epitope when the tumor cell 
switches from the housekeeping proteasome to the immune proteasome with induction by IFN, 
which, for example, may be produced by tumor-infiltrating CTLs. 

In a preferred embodiment, immunization of a patient is with a vaccine that includes a 
housekeeping epitope. Many preferred TAAs arc associated exclusively with a target cell, 
particularly in the case of infected cells. In another embodiment, many preferred TAAs are the 
result of deregulated gene expression in transformed cells, but are found also in tissues of the testis, 
ovaries and fetus. In another embodiment, useful TAAs are expressed at higher levels in the target 
cell than in other cells. In still other embodiments, TAAs are not differentially expressed in the 
target cell compare to other cells, but are still usefiil since they are involved in a particular function 
of the cell and differentiate the target cell from most other peripheral cells; in such embodiments, 
healthy cells also displaying the TAA may be collaterally attacked by the induced T cell response, 
but such collateral damage is considered to be far preferable to the condition caused by the target 
cell. 

A prefenred embodiment of the present invention includes a method of administering a 
vaccine including a housekeeping epitope to induce a therapeutic immune response. The vaccine is 
administered to a patient in a manner consistent with the standard vaccine delivery protocols that 
are well known in the art Methods of administering epitopes of TAAs include, without limitation, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
and mucosal administration. A particularly useful method of vaccine delivery to elicit a CTL 
response is disclosed in PCT Publication No. WO 99/01283, entitled "A METHOD OF 
INDUCING A CTL RESPONSE," filed on July 10, 1998. 
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Because tiie epitope synchronization system has utihty in inducing a cell mediated immune 
response, a vaccine to induce a specific Tcell response to a target cell is likewise included in a 
preferred embodiment of the present invention. The vaccine contains a housekeeping epitope in a 
concentration effective to cause a pAPC or populations of pAPCs to display housekeeping 
epitopes. Advantageously, the vaccine can include a plurality of housdceeping epitopes or one or 
more housekeeping epitopes optionally in combination with one or more immune epitopes. 
Formulations of the vaccine contain peptides and/or nucleic acids in a concentration sufficient to 
cause pAPCs to present the epitopes- The formulations preferably contain epitopes in a total 
concentration of about l^g-lmg/ 100^1 of vaccine preparation. Conventional dosages and dosing 
for peptide vaccines and/or nucleic acid vaccines can be used with the present invention, and such 
dosing regimens arc well understood in the art. In one embodiment, a single dosage for an adult 
human may advantageously be from about I to about 5000 of such a composition, administered 
one time or multiple times, e.g., in 2, 3, 4 or more dosages separated by 1 week, 2 weeks, I month, 
or more, insulin pump delivers 1 ul per hour (lowest frequency) ref intranodal method patent. 

The compositions and methods of the invention disclosed herein further contenq)late 
incorporating adjuvants into the formulations in order to enhance the performance of the vaccines. 
Specifically, the addition of adjuvants to the formulations is designed to enhance the delivery or 
uptake of the epitopes by tiae pAPCs. The adjuvants contemplated by the present invention are 
known by those of skill in the art and include, for example, GMCSF, GCSF, IL-2, lL-12, BCG, 
tetanus toxoid, osteopontin, and ETA-1 . 

In some embodiments of the invention, the vaccines can include a recombinant organism, 
such as a virus, bacterium or parasite, genetically engineered to express an epitope in a host. For 
example, Listeria monocytogenes, a gram-positive, facultative intracellular bacterium, is a potent 
vector for targeting TuAAs to.fhe immune system. In a prefmed embodiment, this vector can be 
engineered to express a housekeeping epitope to induce therapeutic responses. The normal route of 
infection of this organism is through the gut and can be delivered orally. In another embodiment, 
an adenovirus (Ad) vector encoding a housekeeping epitope for a TuAA can be used to induce anti- 
virus or anti-tumor responses. Bone marrow-derived dendritic cells can be transduced with the 
virus construct and then injected, or the virus can be delivered directly via subcutaneous injection 
into an animal to induce potent T-cell responses. Another embodiment employs a recombinant 
vaccinia virus engineered to encode amino acid sequences corresponding to a housekeeping epitope 
for a TAA Vaccinia viruses carrying constructs with the aj^ropriate nucleotide substitutions in 
the form of a minigene construct can direct the expression of a housekeeping epitope, leading to a 
therapeutic T cell response against the epitope. 

The immunization with DNA requires that APCs take up the DNA and express the encoded 
proteins or peptides. It is possible to encode a discrete class I peptide on the DNA, By 
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immunizing with this constmct, APCs can be caused to express a housekeeping epitope, which is 
then displayed on class I MHC on the surface of the cell for stimulating an appropriate CTL 
response. Constructs generally relying on termination of translation or non-proteasomal proteases 
for generation of proper termini of housekeeping q>itDpes have been described m U.S. Patent 

5 application No. 09/561,572 entitied EXPRESSION VECTORS ENCODING EPITOPES OF 
TARGET-ASSOCIATED ANTIGENS, filed on April 28, 2000. 

It can be desirable to express housekeeping peptides in the context of a larger protein. 
Processing can be detected even when a small number of amino acids are present beyond the 
terminus of an qjitope. Small peptide hormones are usually proteolytically processed from longer 

10 translation products, often in the size range of approximately 60-120 amino acids. This feet has led 
some to assume that this is the minimum size that can be efficientiy translated. In some 
embodiments, the housekeeping peptide can be embedded in a translation product of at least about 
60 amino acids. In other embodiments the housekeeping peptide can be embedded in a translation 
product of at least about 50, 30, or 15 amino acids. 

15 Due to differential proteasomal processing, the immune protcasome of the pAPC produces 

peptides tiiat are different from those produced by the housekeeping proteasome in peripheral body 
cells. Thus, in expressing a housekeeping peptide in the context of a larger protein, it is preferably 
expressed in the APC in a context other than its full length native sequence, because, as a 
housekeeping epitope, it is generally only efficiently processed from the native protein by the 

20 housekeeping proteasome, which is not active in the APC. In order to encode the housekeeping 
epitope in a DNA sequence encoding a larger protein, it is useful to find flanking areas on either 
side of the sequence encoding the epitope that permit appropriate cleavage by the immune 
proteasome in order to liberate that housekeeping epitope. Such a sequence ensuring epitope 
synchronization is referred to hereinafter as a SYNCHROTOPE™. Altering flankmg amino acid 

25 residues at the N-terminus and C-tenninus of the desired housekeeping epitope can facilitate 
appropriate cleavage and generation of the housekeeping epitope in the APC. Sequences 
embedding housekeeping epitopes can be designed de novo and screened to determine which can 
be successfully processed by immune proteasoraes to liberate housekeepmg epitopes. 

Alternatively, another strategy is very effective for identifying sequences allowing 

30 production of housekeeping epitopes in APC. A contiguous sequence of amino acids can be 
gMierated from head to tail arrangement of one or more housekeeping epitqws. A construct 
expressing this sequence is used to immunize an animal, and the resulting T cell response is 
evaluated to determine its specificity to one or more of the epitopes in the array. By definition, 
these iimnune responses indicate housekeeping epitopes that are processed in the pAPC effectively. 

35 The necessary flanking areas around this epitope are thereby defined. The use of flanking regions 

of about 4-6 amino acids on either side of the desired peptide can provide the necessary 
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information to facilitate proteasome processing of the housekeeping epitope by the immune 
proteasome. Therefore, a SYNCHROTOPE™ of approximately 16-22 ammo acids can be inserted 
into, or fused to, any protein sequence effectively to result in that housekeeping epitope being 
produced in an APC. hi alternate embodiments the whole head-to-tail array of epitopes, or just the 
epitopes immediately adjacent to the correctly processed housekeeping epitope can be similarly 
transferred from a test construct to a vaccine vector. 

In a preferred embodiment, the housekeeping epitopes can be embedded between known 
immune epitopes, or segments of such, thereby providing an appropriate context for processing. 
The abutment of housekeeping and immune qjitopes can generate the necessary context to enable 
the immune proteasome to liberate the housekeeping epitope, or a larger fragment, preferably 
including a correct C-terminus. It can be useful to screen constructs to verify that the desired 
epitope is produced. The abutment of housekeeping epitopes can generate a site cleavable by the 
immune proteasome. Some embodiments of the invention employ known epitopes to flank 
housekeeping epitopes in test substrates; in others, screening as described below arc used whe&er 
the flanking regions are arbitrary sequences or mutants of the natural flanking sequence, and 
whether or not knowledge of proteasomal cleavage preferences are used in designing the 
substrates. 

Cleavage at the mature N-terminus of the epitope, while advantageous, is not required, 
since a variety of N-terminal trimming activities exist in the cell that can generate the mature N- 
terminus of the epitope subsequent to proteasomal processing. It is preferred that such N-tenninal 
extension be less than about 25 amino acids in length and it is further preferred that the extension 
have few or no proline residues. Preferably, in screening, ccmsideration is given not only to 
cleavage at the ends of the epitope (or at least at its C-terminus), but consideration also can be 
given to ensure limited cleavage within the epitope. 

Shotgun approaches can be used in designing test substrates and can increase the efficiency 
of screening. In one embodiment multiple epitopes can be assembled one after the other, with 
individual epitopes possibly appearing more than once. The substrate can be screened to determine 
which epitopes can be produced. In the case where a particular epitope is of concern a substrate can 
be designed in which it appears in multiple different contexts. When a single epitope appearing in 
more than one context is liberated from the substrate additional secondary test substrates, in which 
individual instances of the epitope are removed, disabled, or are unique, can be used to determine 
which are being liberated and truly constitute SYNCHROTOPE™s. 

Several readily practicable screens exist A preferred in vitro screen utilizes proteasomal 
digestion analysis, using purified immune proteasomes, to determine if the desired housekeeping 
epitope can be liberated from a synthetic peptide embodying the sequence in question. The position 
of the cleavages obtained can be determined by techniques such as mass spectrometry, HPLC, and 
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N-terminal pool sequencing; as described in greater detail in U. S. Patent Applications entitled 
METHOD OF EPITOPE DISCOVERY, EPITOPE SYNCmONIZATION IN ANHGEN 
PRESENTING CELLS, two Provisional U. S. Patent Applications entitled EPITOPE 
SEQUENCES, which are all cited and incorporated by reference above. 
5 Alternatively, in vivo screens such as immunization or target sensitization can be 

employed For immunization a nucleic acid construct capable of expressing the sequence in 
question is used. Harvested CIL can be tested for their ability to recognize target cells presenting 
tiie housekeeping epitope in question. Such targets cells are most readily obtained by pulsing cells 
expressing the appropriate MHC molecule with synthetic peptide embodying the mature 

10 housekeeping epitope. Alternatively, cells known to express housekeeping proteasome and the 

antigen from which the housekeeping epitope is derived, either endogenously or through genetic 
engineering, can be used To use target sensitization as a screen, CTL, or preferably a CTL clone, 
that recognizes the housekeeping epitope can be used In this case it is the target cell that expresses 
the embedded housekeeping epitope (instead of the pAPC during immunization) and it must 

15 express immune proteasome. Generally, the target cell can be transformed wifli an appropriate 
nucleic acid construct to confer expression of the embedded housekeeping epitope. Loading with a 
synthetic peptide embodying the embedded epitope using peptide loaded liposomes or a protein 
transfer reagent such as BIOPORTER*^ (Gene Therapy Systems, San Diego, CA) represents an 
alternative. 

20 Additional guidance on nucleic acid constructs useful as vaccines in accordance with the 

present invention are disclosed in U.S. Patent Application No. 09/561,572 entitled "EXPRESSION 
VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED ANTIGENS," filed on April 
28, 2000. Further, expression vectors and methods for their design, which are useful in accordance 
with the present invention are disclosed in U.S. Patent Application No. 60/336,968 (attorney 

25 docket number CTLIMM.022PR) entitled "EXPRESSION VECTORS ENCODING EPITOPES 

OF TARGET-ASSOCIATED ANTIGENS AND METHODS FOR THEIR DESIGN," filed on 
1 1/7/2001, which is incorporated by reference in its entirety. 

A preferred embodiment of the present invention includes a method of administering a 
vaccine including an epitope (or epitopes) to induce a therapeutic immime response. The vaccine 

30 is administered to a patient in a manner consistent with the standard vaccine delivery protocols that 

are known in the art Methods of administering epitopes of TAAs including, without limitation, 
transdermal, intranodal, perinodal, oral, intravenoxis, intradermal, intramuscular, intraperitoneal, 
and mucosal administration, including delivery by injection, instillation or inhalatiorL A 
particularly useful meHiod of vaccine ^livery to elicit a CTL response is disclosed in Australian 

35 Patent No. 739189 issued January 17, 2002; U.S. Patent Application No. 09/380,534, filed on 
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September 1, 1999; and a Continuation-in-Part thereof U.S. Patent Application No. Q9ni6,lZl 
both entitled "A METHOD OF INDUCING A CTL RESPONSE," filed on February 2, 2001. 
Reagents Recognizing Epitopes 

In another aspect of the invention, proteins with binding specificity for the epitope and tiie 
epitope-MHC molecule con:q)lex are contenq}late4 as well as the isolated cells by which they can 
be expressed, hi one set of embodiments these reagents take the form of immunoglobulins: 
polyclonal sera or monoclonal antibodies (mAb), methods for the generation of which are well 
know in the art. Generation of mAb with specificity for peptide-MHC molecule complexes is 
known in the art. See, for example, Aharoni et al. Nature 351:147-150, 1991; Andersen et al. Proc 
Natl Acad, ScL USA 93:1820-1824, 1996; Dadaglio et al. Immunity 6:727-732, 1997; Due et al. 
Int Immunol 5:427-431,1993; Eastman et al. Eur, J, Imtnunol 26:385-393, 1996; Engberg et al. 
Immunotechnology 4:273-278, 1999; Porgdor et al. Imnmnity 6:715-726, 1997; Puri et al. J. 
Immunol 158:2471-2476, 1997; and Polakova, K., et al. J. Immunol. 165 342-348, 2000; all of 
which are hereby incorporated by reference in their entirety. 

In other embodiments the compositions can be used to induce and generate, in vivo and in 
vitro, T-cells specific for the any of the epitopes, including those listed in Table 1 A, for example. 
Thus, embodiments also relate to and include isolated T cells, T cell clones, T cell hybridomas, or a 
protein containing the T cell receptor (TCR) binding domain derived fi-om the cloned gene, as well 
as a recombinant cell expressing such a protein. Such TCR derived proteins can be simply the 
extra-cellular domains of the TCR, or a fusion with portions of anotiier protein to confer a desired 
property or function. One example of such a fusion is the attachment of TCR binding domains to 
the constant regions of an antibody molecule so as to create a divalent molecule. The construction 
and activity of molecules following this general pattern have been reported, for example, Plaksin, 
D. et al. /. Immunol 158:2218-2227, 1997 and Lebowitz, M.S. et al. CeU Immunol. 192:175-184, 
1999, which are hereby incorporated by reference in their entirety. The more general construction 
and use of such molecules is also treated in U.S. patent 5,830,755 entitled T CELL RECEPTORS 
AND THEIR USE IN THERAPEUTIC AND DIAGNOSHC METHODS, which is hereby 
incorporated by reference in its entirety. 

The generation of such T cells can be readily accomplished by standard inmiunization of 
laboratory animals, and reactivity to human target cells can be obtained by immunizing with human 
target cells or by immuniang HLA-transgenic animals with the antigen/epitope. For some 
therapeutic approaches T cells derived from the same species are desirable. While such a cell can 
be created by cloning, for example, a murine TCR into a human T cell as contemplated above, in 
vitro immunization of human cells offers a potentially faster option. Techniques for in vitro 
immunization, even using naive donors, are know in the field, for example, Stauss et al, Proc, 
Natl Acad, ScL USA 89:7871-7875, 1992; Salgaller et al. Cancer Res. 55:4972-4979, 1995; Tsai et 
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al., J. Invmunol 158:1796-1802, 1997; and Chung et al., /. Immunother. 22:279-287, 1999; which 
are hereby incoiporated by reference in their entirety. 

Any of diese molecules can be conjugated to enzymes, radiochemicals, fluorescent tags, 
and toxins, so as to be used in the diagnosis (imaging or other detection), monitoring, and treatment 
5 of tiie pathogenic condition associated with the epitope. Thus a toxin conjugate can be 

administered to kill tumor cells, radiolabeling can facilitate imaging of epitope positive tumor, an 
enzyme conjugate can be used in an ELISA-like assay to diagnose cancer and confirm epitope 
expression in biopsied tissue. In a further embodiment, such T cells as set forth above, following 
expansion accomplished through stimulation with the epitope and/or cytokines, can be 

10 administered to a patient as an adoptive immunotherapy. 

Reagents Comprising Epitopes 

A iurdier aspect of the invention provides isolated epitope-MHC complexes. In a 
particularly advantageous embodiment of this aspect of the invention, the complexes can be 
soluble, multimeric proteins such as those described in U. S. Patent No. 5,635,363 (tetramers) or U. 

15 S. Patent No. 6,015,884 (Ig-dimers), both of which are hereby incorporated by reference in their 

entirety. Such reagents are useful in detecting and monitoring specific T cell responses, and in 
purifying such T cells. 

Isolated MHC molecules complexed with epitopic peptides can also be incoiporated mto 
planar lipid bilayers or liposomes. Such compositions can be used to stimulate T cells in vitro or, in 

20 the case of liposomes, in vivo. Co-stimulatory molecules (e.g. B7, CD40, LFA-3) can be 

incorporated into ttit same con^sitions or, especially for in vitro work, co-stimulation can be 
provided by anti-K^o-receptor antibodies (e.g. anti*CD28, anti-CD154, anti-CD2) or cytokines (e.g. 
IL-2, IL-12). Such stimulation of T cells can constitute vaccination, drive expansion of T cells in 
vitro for subsequent infusion in an immuotherapy, or constitute a step in an assay of T cell 

25 function. 

The epitope, or more directly its complex with an MHC molecule, can be an important 
constituent of functional assays of antigen-specific T cells at either an activation or readout step or 
both. Of the many assays of T cell function current in the art (detailed procedures can be found in 
standard immunological references such as Current Protocols in Immunology 1999 John Wiley & 

30 Sons Inc., N.Y., which is hereby incorporated by reference in its entirety) two broad classes can be 

defined, those that measure the response of a pool of cells and those that measure the response of 
individual cells. Whereas the former conveys a global measure of the strength of a response, the 
latter allows determination of the relative frequency of responding cells. Examples of assays 
measuring global response are cytotoxicity assays, ELISA, and proliferation assays detecting 

35 cytokine secretion. Assays measuring the responses of individual cells (or small clones derived 

from them) include limiting dilution analysis (LDA), ELISPOT, flow cytometric detection of 
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unsecreted cytokine (described in U.S. Patent No. 5,445,939, entitled "METHOD FOR 
ASSESSMENT OF THE MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM" and U.S. Patent 
Nos 5,656,446; and 5,843,689, both entitled '"METHOD FOR THE ASSESSMENT OF THE 
MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM," reagents for wiiich are sold by Becton, 
5 Dickinson & Company under the tradename TASTIMMUNE', -which patents are hereby 
incorporated by reference in their entirety) and detection of specific TCR with tetramers or Ig- 
dimers as stated and referenced above. The comparative virtues of these techniques have been 
reviewed in Yee, C. et al. Current Opinion in Immunology, 13:141-146, 2001, which is hereby 
incorporated by reference in its entirety. Additionally detection of a specific TCR rearrangement 

10 or expression can be accomplished through a variety of established nucleic acid based techniques, 

particularly in situ and single-cell PCR techniques, as will be apparent to one of skill in the art. 

These fiinctional assays are used to assess endogenous levels of immunity, response to an 
immunologic stimulus (e.g. a vaccine), and to monitor immune status through the course of a 
disease and treatment. Except when measuring endogenous levels of immunity, any of these assays 

1 5 presume a preliminary step of immunization, whether in vivo or in vitro depending on the nature of 

the issue being addressed. Such immunization can be carried out with the various embodiments of 
the invention described above or with other forms of immunogen (e.g., pAFC-tumor cell fusions) 
that can provoke similar immunity. With the exception of PCR and tetramer/Ig-dimer type 
analyses which can detect expression of the cognate TCR, these assays generally benefit from a 

20 step of in vitro antigenic stimulation which can advantageously use various embodiments of the 

invention as described above in order to detect the particular functional activity (highly cytolytic 
responses can sometimes be detected directly). Finally, detection of cytolytic activity requires 
^itope-displaying target cells, which can be generated using various embodiments of the 
invention. The particular embodiment chosen for any particular step depends on the question to be 

25 addressed, ease of use, cost, and the like, but the advantages of one embodiment over another for 
any particular set of circumstances will be apparent to one of skill in the art 
Tumor Associated Antigens 

Epitopes of the present invention are derived from the TuAAs tyrosinase (SEQ ID NO. 2), 
SSX-2, (SEQ ID NO. 3), PSMA (prostate-specific membrane antigen) (SEQ ID NO. 4), GPlOO, 

30 (SEQ ID NO. 70), MAGE-1, (SEQ ID NO. 71), MAGE-2, (SEQ ID NO. 72), MAGE-3, (SEQ ID 

NO, 73), NY-ESO^l, (SEQ ID NO. 74), FRAME, (SEQ ID NO. 77), PSA, (SEQ ID NO. 78), and 
PSCA, (SEQ BD NO. 79). The natural coding sequences for these eleven proteins, or any segments 
within them, can be determined from their cDNA or complete coding (cds) sequences, SEQ ID 
NOS. 5-7, and 80-87, respectively. 

35 Tyrosinase is a melanin biosynthetic enzyme that is considered one of the most specific 

markers of melanocytic differentiation. Tyrosinase is expressed in few cell types, primarily in 
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melanocytes, and high levels are often found in melanomas. The usefulness of tyrosinase as a 
TuAA is taught in U.S. Patent 5,747,271 entitled "METHOD FOR IDENTIFYING INDIVIDUALS 
SUFFERING FROM A CELLULAR ABNORMALITY SOME OF WHOSE ABNORMAL 
CELLS PRESENT COMPLEXES OF HLA.A2n^0SINASE DERIVED PEPTIDES, AND 
MEraODS FOR TREATING SAID INDIVIDUALS" which is hereby incorporated by reference 
in its entirety, 

GPlOO, also known as PMell7, also is a melanin biosynthetic protein expressed at high 
levels in melanomas, GPlOO as a TuAA is disclosed in U.S. Patent 5,844,075 entitled 
"MELANOMA ANTIGENS AND THEIR USE IN DLVGNOSTIC AND THERAPEUTIC 
METHODS " which is hereby incorporated by reference in its entirety. 

SSX-2, also know as Hom-Mel-40, is a member of a femily of highly c<mserved cancer- 
testis antigens (Gure, A.O. et al. Int. J. Cancer 72:965-971, 1997, which is hereby incorporated by 
reference in its entirety). Its identification as a TuAA is taugjit in U.S. Patent 6,025,191 entitled 
"ISOLATED NUCLEIC ACID MOLECULES WHICH ENCODE A MELANOMA SPECIFIC 
ANTIGEN AND USES THEREOF," which is hereby incorporated by reference in its entirety. 
Cancer-testis antigens are found in a variety of tumors, but are generally absent from normal aduh 
tissues except testis. Expression of different members of the SSX family have been found 
variously in tumor cell lines. Due to the high degree of sequence identity among SSX family 
members, similar epitopes from more than one member of the femily will be generated and able to 
bind to an MHC molecule, so that some vaccines directed against one member of this femily can 
cross-react and be effective against other members of this family (see example 3 below). 

MAGE-1, MAGE-2, and MAGE-3 are members of anoflier family of cancer-testis antigens 
originally discovered in melanoma (MAGE is a contraction of melanoma-associated antigen) but 
found in a variety of tumors. The identification of MAGE proteins as TuAAs is taught in U.S. 
Patent 5,342,774 entitled NUCLEOTIDE SEQUENCE ENCODC^G THE TUMOR REJECTION 
ANTIGEN PRECURSOR, MAGE-I, which is hereby incorporated by reference in its entirety, and 
in numerous subsequent patents. Currently there are 17 entries for (human) MAGE in the SWISS 
Protein database. There is extensive similarity among these proteins so in many cases, an epitope 
from one can induce a cross-reactive response to other members of ttie family. A few of these have 
not been observed in tumors, most notably MAGE-Hl and MAGE-Dl, which are expressed in 
testes and brain, and bone marrow stromal cells, respectively. The possibility of cross-reactivity on 
normal tissue is ameliorated by the fact that they are among the least similar to the other MAGE 
proteins. 

NY-ESO-1, is a cancer-testis antigen found in a wide variety of tumors, also known as 
CTAG-1 (Cancer-Testis Antigen-l) and CAG-3 (Cancer Antigen-3). NY-ESO-1 as a TuAA is 
disclosed in U.S. Patent 5,804,381 entitled ISOLATED NUCLEIC ACID MOLECULE 
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ENCODING AN ESOPHAGEAL CANCER ASSOCIATED ANTIGEN, THE ANTIGEN ITSELF. 
AND USES TEffiREOF which is hereby incorporated by reference in its entirety. A paralogous 
locus encoding antigens with extensive sequence identity, LAGE-la/s (SEQ ID NO. 75) and 
LAGE-lb/L (SEQ ID NO. 76), have been disclosed in publicly available assemblies of the human 
5 genome , and have been concluded to arise through alternate splicing. Additionally, CT-2 (or 

CTAG-2, Cancer-Testis Antigen-2) appears to be either an allele, a mutant, or a sequencing 
discrepancy of LAGE-lb/L. Due to the extensive sequence identity, many epitopes fromNY-ESO- 
1 can also induce immunity to tumors expressing these other antigens. See figure I. The proteins 
are virtually identical through amino acid 70. From 71-134 the longest run of identities between 

10 NY-ESO-1 and LAGE is 6 residues, but potentially cross-reactive sequences are present And 

from 135-180 NY-ESO and LAGE-la/s are identical except for a smgle residue, but LAGE-lb/L is 
unrelated due to the alternate splice. The CAMEL and LAGE-2 antigens appear to derive from the 
LAGE-1 mRNA, but fitjm alternate reading frames, thus giving rise to unrelated protein sequences. 
More recently, GenBank Accession AF277315,5, Homo sapiens chromosome X clone RP5- 

15 865E18, RP5-1087L19, complete sequence, reports tfiree independent loci in this region which are 

labeled as LAGEl (corresponding to CTAG-2 in the genome assemblies), plus LAGE2-A and 
LAGE2-B (both corresponding to CTAG-1 in the genome assemblies). 

PSMA (prostate-specific membranes antigen), a TuAA described in U.S. Patent 5,538,866 
entitled ^TROSTATE-SPECIFIC MEMBRANES ANTIGEN" which is hereby incorporated by 

20 reference in its entirety, is expressed by normal prostate epithelium and, at a higher level, in 

prostatic cancer, ft has also been found in flie neovasculature of non-prostatic tumors. PSMA can 
thus form the basis for vaccines directed to both prostate cancer and to the neovasculature of other 
tumors. This later concept is more fully described in a provisional U.S. Patent application No. 
60/274,063 entitled ANTI-NEOVASCULAR VACCINES FOR CANCER, filed March 7, 2001, 

25 and U.S. Application No. / attorney docket number CTLIMM.015A, filed on March 7, 

2002, entitled "ANTI-NEOVASCULAR PREPARATIONS FOR CANCER," both of which are 
hereby incorporated by reference in their entirety. Alternate splicing of the PSMA mRNA also 
leads to a protein with an apparent start at Metjs, toeby deleting the putative membrane anchor 
region of PSMA as described in U.S. Patent 5,935,818 entitled "ISOLATED NUCLEIC ACID 

30 MOLECULE ENCODING ALTERNATIVELY SPLICED PROSTATE-SPECIHC 

MEMBRANES ANTIGEN A>n> USES THEREOF' which is hereby incorporated by reference in 
its entirety. A protein termed PSMA-like protein, Genbank accession number AF261715, is nearly 
identical to amino acids 309-750 of PSMA and has a different expression profile. Thus the most 
preferred epitopes are those with an N-terminus located fi-om amino acid 58 to 308. 

35 PRAME, also know as MAPE, DAGE, and 0IP4, was originally observed as a melanoma 

antigen. Subsequently, it has been recognized as a CT antigen, but unlike many CT antigens (e.g., 
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MAGE, GAGE, and BAGE) it is expressed in acute myeloid leukemias. FRAME is a member of 
the MATE family which consists largely of hypothetical proteins with which it shares limited 
sequence similarity. The usefulness of FRAME as a TuAA is taught in U.S. Patent 5,830,753 
entitled "ISOLATED NUCLEIC ACID MOLECULES CODING FOR TUMOR REJECTION 
5 ANTIGEN PRECURSOR DAGE AND USES THEREOF" which is hereby incorporated by 

reference in its entirety. 

PSA, prostate specific antigen, is a peptidase of the kallikrein family and a differentiation 
antigen of the prostate. Expression in breast tissue has also been reported. Alternate names include 
gamma-seminoprotcin, kallikrein 3, seminogelase, seminin, and P-30 antigen. PSA has a high 
10 degree of sequence identity with the various alternate splicing products prostatic/glandular 

kallikrein-l and -2, as weD as kalikrem 4, which is also expressed in prostate and breast tissue. 
Other kallikreins generally share less sequence identity and have different expression profiles. 
Nonetheless, cross-reactivity that might be provoked by any particular epitope, along with the 
likelihood that that epitope would be liberated by processing in non-target tissues (most generally 
15 by the housekeeping proteasome), should be considered in designing a vaccine. 

PSCA, prostate stem cell antigen, and also known as SCAH-2, is a differentiation antigen 
preferentially expressed in prostate epithelial cells, and overexpresssed in prostate cancers. Lower 
level expression is seen in some normal tissues including neuroendocrine cells of the digestive tract 
and collecting ducts of the kidney. PSCA is described in U.S. Patent 5,856,136 entitled **HUMAN 
20 STEM CELL ANTIGENS'* which is hereby incorporated by reference in its entirety. 

Synaptonemal conoplex protein 1 (SCP-1), also known as HOM-TES-14, is a mciosis- 
associated protein and also a canccr-testis antigen (Tureci, O., et al. Proc. Natl. Acad. Sci. USA 
95:5211-5216, 1998). As a cancer antigen its expression is not cell-cycle regulated and it is found 
frequently in gliomas, breast, renal cell, and ovarian carcinomas. It has some similarity to myosins, 
25 but with few enough identities that cross-reactive epitopes are not an immediate prospect. 

The ED-B domain of fibronectin is also a potential target Fibronectin is subject to 
developmentally regulated alternative splicing, with the ED-B domain being encoded by a single 
exon that is used primarily in oncofetal tissues (Matsuura, H, and S. Hakomori Proc Natl Acad. 
ScL USA 82:6517-6521, 1985; Camemolla. B. et al. J, Cell Biol 108:1139-1148, 1989; London- 
30 Rosa, B. et al. Cancer /?ey.50:1608-1612, 1990; Nicolo, G. et al. Cell Differ, Dev. 32:401-408, 

1990; Borsi,L.etal, J2g>. CellRes. 199:98-105, 1992; Oyama, F. et aL Cancer Res. 53:2005-2011, 
1993; Mandel, U. et al. APMIS 102:695-702, 1994; Famoud, MR. et al. Int. 7, Cancer 61:27-34, 
1995; Pujuguet, P. et al. Am. J. Pathol. 148:579-592, 1996; Gabler, U. et al. Heart 75:358-362, 
1996;Chevalier, X. Br. J. Rlieufnatol 35:407-415, 1996; Midulla, M. Cancer Res. 60:164-169, 
35 2000), 
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The ED-B domain is also expressed in fibronectin of the neovasculature (Kaczmarek, J. et 
al. Int y. Ccaicer 59:11-16, 1994; Castellani, P. et al. Int J. Cancer 59:612-618, 1994; Neri, D. et 
al. Nat Biotech. 15:1271-1275, 1997; Karelina, T.V. and hX. Eisen Cancer Detect Prev. 22:438- 
444, 1998; Tarli, L. et al Blood 94:192-198, 1999; Castellani, P. et al. Acta Neurochir, (Wieri) 
5 142:277-282, 2000). As an oncofetal domain, the ED-B domain is commonly found in the 

fibronectin expressed by neoplastic cells in addition to being expressed by the neovasculature. 
Thus, CTL-inducing vaccines targeting the ED-B domain can exhibit two mechanisms of action: 
direct lysis of tumor cells, and disruption of the tumor's blood supply through destruction of the 
tumor-associated neovasculature. As CTL activity can decay rapidly after withdrawal of vaccine, 

10 interference with normal angiogenesis can be minimal. The design and testing of vaccines targeted 

to neovasculature is described in Provisional U.S. Patent Application No. 60/274,063 entitled 
"ANn-NEOVASCXJLATURE VACCINES FOR CANCER" and in U.S. Patent Application No, 
/ attorney docket number CTLIMM.015A, entided "ANTI-NEOVASCULATURE 

PREPARATIONS FOR CANCER, filed on date even wife this application (March 7, 2002), A 

15 tumor cell line is disclosed in Provisional U.S. Application No. _/ , filed on March 7, 2002, 

attorney docket number CTLIMM.028PR, entitled 'TCLA-TRANSGENIC MURINE TUMOR 
CELL LINE," which is hereby incorporated by reference in its entirety. 

Carcinoembiyonic antigen (CEA) is a paradigmatic oncofetal protein first described in 
1965 (Gold and Freedman, J. Exp. Med. 121: 439-462, 1965, Fuller references can be found in the 

20 Online Medelian Inheritance in Man; record * 114890). It has officially been renamed 
carcinoembiyonic antigen-related cell adhesion naolecule 5 (CEACAM5). Its expression is most 
strongly associated with adenocarcinomas of the epithelial lining of the digestive tract and in fetal 
colon. CEA is a member of the immunoglobulin supergene family and the defining member of the 
CEA subfamily. 

25 HER2/NEU is an oncogene related to the epidermal growth factor receptor (van de Vijver. 

et al., New Eng. J, Med, 319:1239-1245, 1988), and apparently identical to the C-ERBB2 oncogene 
(Di Fiore, et al.. Science 237: 178-182, 1987). The over-expression of ERBB2 has been implicated 
in the neoplastic transformation of prostate cancer. As HER2 it is amplified and over-expressed in 
25-30% of breast cancers among other tumors where expression level is correlated with the 

30 aggressiveness of die tumor (Slamon, et al., New Eng. J. Med. 344:783-792, 2001). A more detailed 

description is available in the Online Medelian Inheritance in Man; record * 164870. 

All references mentioned herein are hereby incorporated by reference in their entirety. 
Further, incorporated by reference in its entirety is U.S. Patent Application No. 10/005,905 
(attorney docket number CTLIMM.021CPI) entitled "EPITOPE SYNCHRONIZATION IN 

35 ANTIGEN PRESENTING CELLS," filed on November 7, 2001 and a continuation thereof, U.S. 

Application No. _/ , filed on December 7, 2000, attorney docket number 

-42- 



wo 2003/008537 



PCTAJS2002/010189 



CTLIMM.21CPIC, also entitled "EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING 
CELLS." 

Useful epitopes were identified and tested as described in the following exanq>les. 
However, these examples are intended for illustration purposes only, and should not be construed 
5 as limiting Ifae scope of the inventicKi in any way. 

EXAMPLES 

Sequences of Specific Prefe rred Epitopes 
Example 1 

10 Manufacture of tyrosinase epitopes. 

A. Synthetic production of epitopes 

Peptides having an amino acid sequence of any of SEQ ID NO: 1, 8, 9, 11-23, 2-29, 32-44, 
47-54, 56-63, 66-68 88-253, or 256-588 are synthesized using either FMOC or tBOC solid phase 
synthesis methodologies. After synthesis, the peptides are cleaved from flieir supports with either 

15 trifluoroacetic acid or hydrogen fluoride, respectively, in the presence of appropriate protective 

scavengers. After removing the acid by evaporation, the peptides are extracted with ether to 
remove the scavengers and the crude, precipitated peptide is then lyophilized. Purity of the crude 
peptides is determined by HPLC, sequence analysis, amino acid analysis, counterion content 
analysis and other suitable means. If the crude peptides are pure enough (greater than or equal to 

20 about 90% pure), they can be used as is. If purification is required to meet drug substance 

specifications, the peptides are purified usmg one or a combination of the following: re- 
precipitation; reverse-phase, ion exchange, size exclusion or hydrophobic interaction 
chromatography; or counter-current distribution. 
Drug product formulation 

25 GMP-grade peptides are formulated in a parenterally acceptable aqueous, organic, or 

aqueous-organic buffer or solvent system in which they remain both physically and chemically 
stable and biologically potent Generally, buffers or combinations of buffers or combinations of 
buffers and organic solvents are appropriate. The pH range is typically between 6 and 9. Organic 
modifiers or other excipients can be added to help solubilize and stabilize tiie peptides. These 

30 include det«[gents, lipids, co-solvents, antioxidants, chelators and reducing agents. In the case of a 

lyophilized product, sucrose or mannitol or otho: lyophilization aids can be added. Peptide 
solutions are sterilized by membrane filtration into their final container-closure system and either 
lyophilized for dissolution in the clinic, or stored until use. 

B. Construction of expression vectors for use as nucleic acid vaccines 

35 The construction of three generic epitope expression vectors is presented below. The 

particular advantages of these designs are set forth in U.S. Patent Application No. 09/561,572 
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entitled •'EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOOATED 
ANTIGENS,*' which has heen incorporated by reference in its entirety above. 

A suitable E. coli strain was ^en transfected with the plasmid and plated out onto a 
selective medium. Several colonies were grown up in suspension culture and positive clones were 
identified by restriction mapping. The positive clone was then grown up and aliquotted into 
storage vials and stored at -70*'C. 

A mini-prep (QIAprep Spin Mini-prep: Qiagen, Valencia, CA) of the plasmid was then 
made from a sample of these cells and automated fluorescent dideoxy sequence analysis was used 
to confirm that the constmct had tfic desired sequence. 

B.l Construction of pVAX-EPl-IRES-EP2 
Overview: 

The starting plasmid for this construct is pVAXl purchased from Ihvitrogen (Carlsbad, 
CA). Epitopes EPl and EP2 were synthesized by GIBCO BRL (Rockville, MD). The IRES was 
excised from pIRES purchased from Clontech (Palo Alto, CA). 
Procedure: 

1 pIRES was digested with EcoRI and NotL The digested fragments were separated by 
agarose gel electrophoresis, and the IRES fragment was purified from the excised band. 

2 pVAXl was digested with EcoRI and NotI, and the pVAXl fragment was gel-purified. 

3 The purified pVAXl and IRES fragments were then ligated toge&er. 

4 Con^tent E. coli of strain DH5a were transformed with the ligation mixture. 

5 Minipreps were made from 4 of the resultant colonies. 

6 Restriction enzyme digestion analysis was performed on die miniprep DNA. One 
recombinant colony havmg the IRES insert was used for further insertion of EPl and EP2. 
This intermediate construct was called pVAX-IRES. 

7 Oligonucleotides encoding EPl and EP2 were synthesized. 

8 EPl was subcloned into pVAX-IRES between Afin and EcoRI sites, to make pVAX- 
EPl-IRES; 

9 EP2 was subcloned into pVAX-EPl-IRES between Sail and NotI sites, to make the 
final construct pVAX-EPl-IRES-EP2. 

10 The sequence of the EP 1-IRES-EP2 insert was confirmed by DNA sequencing. 
B2. Construction of PVAX-EP1-IRES-EP2-ISS-NIS 

Overview: 

The starting plasmid for this construct was pVAX-EPl-IRES-EP2 (Example 1). The ISS 
(immunostimulatory sequence) introduced into this construct is AACGTT, and the NTS (standing 
for nuclear import sequence) used is the SV40 72bp repeat sequence. ISS-NIS was synthesized by 
GIBCO BRL. See Figure 2. 
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Procedure: 

1 pVAX-EPl-IRES-EP2 was digested with Nrul; the linearized plasmid was gel-purified. 

2 ISS-NIS oligonucleotide was synthesized. 

3 The purified linearized pVAX-EPl-IRES-EP2 and synthesized ISS-NIS were ligated 
togetiier. 

4 Competent £. coli of strain DHSa were transfonned with the ligation product 

5 Minipreps were made from resultant colonics. 

6 Restriction enzyme digestions of the minipreps were carried out. 

7 The plasmid with the insert was sequenced. 
B3. Construction of pVAX-EP2-UB-EPl 

Overview: 

The starting plasmid for this construct was pVAXl (Ihvitrogen). EP2 and EPl were 
synthesized by GIBCO BRL. Wild type Ubiquitin cDNA encoding the 76 amino acids in the 
construct was cloned from yeast. 
Procedure: 

1 RT-PCR was performed using yeast mRNA. Primers were designed to amplify the 
complete coding sequence of yeast Ubiquitin. 

2 The RT-PCR products were analyzed using agarose gel electrophoresis. A band with the 
predicted size was gel-purified. 

3 The purified DNA band was subcloned into pZEROl at EcoRV site. The resulting clone 
was named pZERO-UB. 

4 Several clones of pZERO-UB were sequenced to confirm the Ubiquitin sequence before 
further manipulations. 

5 EPl and EP2 were synthesized. 

6 EP2, Ubiquitin and EPl were ligated and the insert cloned into pVAXl between BamHI 
and EcoRI, putting it under control of die CMV promoter. 

7 The sequence of the insert EP2-UB-EP1 was confirmed by DNA sequencing. 
Example 2 

Identification of useful epitope variants. 

The lO-mo: FLPWHRLFLL (SEQ ID NO. 1) is identified as a useful epitope. Based on 
this sequence, numerous variants are made. Variants exhibiting activity in HLA binding assays (see 
Exan^)le 3, section 6) are identiSed as usefiil, and are subsequently incorporated into vaccines. 

The HLA-A2 binding of length variants of FLPWHRLFLL have been evaluated. 
Proteasomal digestion analysis indicates that the C-terminus of the 9-mer FLPWHRLFL (SEQ ID 
NO. 8) is also produced. Additionally the 9-mer LPWHRLFLL (SEQ ID NO. 9) can result fi-om N- 
terminal trimming of the 10-mer. Both are predicted to bind to the HLA-A*0201 molecule, 
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however of these two 9-mers, FLPWHRLFL displayed more significant binding and is preferred 
(see Figs. 3 A and B). 

Sequence variants of FLPWHRLFL are constructed as follow. Consistent with the binding 
coefficient table (see Table 3) fiom the NIH/BIMAS MHC binding prediction program (see 
5 reference in exan^le 3 below), binding can be in:q)roved by changing the L at position 9, an anchor 
position, to V. Binding can also be altered, though generally to a lesser extent, by changes at non- 
anchor positions. Referring generally to Table 3, binding can be increased by employing residues 
with relatively larger coefficients. Changes in sequence can also alter immunogenicity 
independently of their effect on binding to MHC. Thus binding and/or immunogenicity can be 
10 improved as follows: 

By substituting FJL>1,W, or Y for P at position 3; these are all bulkier residues that can 
also improve immimogMiicity independent of the effect on binding. The amine and hydroxyl- 
bearing residues, Q and N; and S and T; respectively, can also provoke a stronger, cross-reactive 
response. 

15 By substituting D or E for W at position 4 to improve binding; this addition of a negative 

charge can also make the epitope more immunogenic, while in some cases reducing cross-reactivity 
with the natural epitope. Alternatively the conservative substitutions of F or Y can provoke a 
cross-reactive response. 

By substituting F for H at position 5 to improve binding. H can be viewed as partially 

20 charged, thus in some cases Ihe loss of pharge can hinder cross-reactivity. Substitution of the fully 
charged residues R or K at this position can enhance immunogenici^ without disrupting charge- 
dependent cross-reactivity. 

By substituting I, L, M, V, F, W, or Y for R at position 6. The same caveats and 
alternatives 3pp\y here as at position 5. 

25 By substituting W or F for L at position 7 to improve binding. Substitution of V, I, S, T, Q, 

or N at this position are not generally predicted to reduce binding affinity by this model (the NIH 
algorithm), yet can be advantageous as discussed above. 

Y and W, which are equally preferred as the Fs at positions 1 and 8, can provoke a useful 
cross-reactivity. Finally, while substitutions in the direction of buDtiness are generally favored to 

30 improve immunogenicity, the substitution of smaller residues such as A, S, and C, at positions 3-7 
can be useful according to the tiieoty that contrast in size, rather than bulkiness per se, is an 
important factor in immunogenicity. The reactivity of the thiol group in C can mtroduce other 
properties as discussed in Chen, J.-L., et al. /. IimnunoL 165:948-955, 2000. 
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HLA Coefficient table for file "A„0201„standard" 


Amino Acid 
Type 


1^ 




3rd 


4th 


5th 


6th 


7th 


8th 


9th 


A 


1,000 


1.000 


1.000 


1.000 


1.000 


l.OOC 


1.000 


1.000 


1.00 


C 


1.000 


0.470 


1.000 


1.000 


1.000 


l.OOG 


l.OOO 


1.000 


1.00 


D 


0.075 


0.100 


0.400 


4.100 


1.000 


l.OOC 


0.490 


l.OOO 


0.00 


E 


0.075 


1.400 


0.064 


4.100 


1.000 


1.000 


0.490 


1,000 


0.00 


F 


4.600 


0.050 


3.700 


1.000 


3.800 


1.900 


5.800 


5.500 


0,01 


G 


1.000 


0.470 


1.000 


1.000 


1.000 


1,000 


0.130 


1.000 


0.01 


H 


0.034 


0.050 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


0.01 


1 


1.700 


9.900 


1.000 


1.000 


1.000 


2,300 


1.000 


0.410 


2.10 


K 


3.50a 


0.100 


0.035 


1.000 


1.000 


1.000 


1.000 


1,000 


0.00 


L 


1.700 


72.000 


3.700 


1,000 


1.000 


2,300 


1.000 


1.000 


4.30 


M 


1.700 


52.000 


3.700 


1.000 


1.000 


2.300 


1.000 


1.000 


1.00 


N 


1.000 


0.470 


1,000 


1.000 


1.000 


1,000 


l.OOO 


l.OOO 


0.01 


P 


0.022 


0.470 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


0.00 


Q 


i.ood 


7.300 


1.000 


1.000 


1.000 


1.000 


l.OOO 


1.000 


0.00 


R 


1.000 


0.010 


0.07^ 


1.000 


1.000 


1.000 


0.200 


1.000 


0.00 


S 


1.000 


0.470 


1.000 


1.00(^ 


1.000 


1.000 


1.000 


1,000 


0.01 


T 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


1.50 


V 


1.700 


6.300 


1.000 


1.000 


1,000 


2.300 


l.OOO 


0.410 


14.00 


W 


4.600 


O.OlO 


8.300 


1. 000 


1.000 


1.700 


7.500 


5.500 


0.01 


Y 


4.600 


O.OlO 


3.200 


1.000 


l.OOO 


1.500 


1.000 


5.500 


0.01 



♦This table and other conqsarable data that are publicly available are useful in designing epitope 
variants and in determining whether a particular variant is substantially similar, or is functionally 
5 similar. 

Example 3 

Cluster Analysis (SSX-lsj-^a). 

1. Epitope cluster region prediction : 

The computer algorithms: SYFPEITHI (internet http:// access at sy^eithi.bmi- 

10 heidelberg.com/Scripts/MHCServer.dll/EpPredictJitm), based on the book "MHC Ligands and 
Peptide Motifs" by H.G.Rammensee, J-Bachmann and S.Stevanovic; and HIA Peptide Binding 
Predictions (NIH) (internet http*y/ access at bimas.dcrt.nih.gov/molbio/hIa_bin), described in 
Parker, K, C, et al., J, Immunol 152:163, 1994; were used to analyze the protein sequence of 
SSX-2 (GL10337583). Epitope clusters (regions with higher than average density of peptide 

15 fragments with high predicted MHC afSnity) were defined as described fully in U.S. Patent 

Application No. 09/561,571 entitled "EPITOPE CLUSTERS." filed on April 28, 2000. Using a 
epitope density ratio cutoff of 2, five and two clusters were defined using &e SYFPETHI and NIH 
algorithms, respectively, and peptides score cutoffs of 16 (SYFPETHI) and 5 (NIH). The highest 
scoring peptide with the NIH algorithm, SSX-24i^, with an estimated halftimc of dissociation of 
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>1000 min., does not overlap any ofcer predicted epitope but does cluster with SSX-2s«5 in 
KIH analysis. 

2. Peptide synthesis and characterization: 

SSX-2,w^ YFSKEEWEKMKASEKIFYVYMKRKYEAKmCLGFKATLP (SEQID NO. 
10) was synthesized by MPS (Multiple Peptide Systems. San Diego. CA 92121) using standard 
solid phase chemistry. Accoiding to the provided 'Certificate of Analysis', the purity of this 

peptide was 95%. 

3. Proteasome digestion : 

Proteasome was isolated from human red blood cells using the proteasome isolation 
protocol described in U.S. Patent Application No. 09/561.074 entitled "METHOD OF EPITOPE 
DISCOVERY," filed on April 28. 2000. SDS-PAGE. western-blotting, and EUSA were used as 
quality control assays. The final concentration of proteasome was 4 mg/ml. which was detenmined 
by non-interfering protein assay (Geno Technologies fiic). Proteasomes were stored at -70»C m 25 
id aliquots. 

SSX-23,.M was dissolved in Milli-Q water, and a 2 mM stock solution prepared and 20nL 

aliquots stored at -20°C. 

1 tube of proteasome (25 |iL) was removed from storage at-70''C and thawed on ice. It was 
liien mixed thorou^y with 12.5jiL of 2mM peptide by repipetting (samples were kept on ice). A 
5HL san^le was immediately removed after mixing and transferred to a tube containing 1.25jiL 
10%TFA (final concentration of TFA was 2%); the T=0 min sample. The proteasome digestion 
reaction was then started and carried out at ST'C in a programmable thermal controller. Additional 
SpL samples were taken out at 15. 30, 60, 120. 180 and 240 min respectively, the reaction was 
stopped by adding the sample to 1.25nL 10% TFA as before. Samples were kept on ice or fiozen 
until being analyzed by MALDI-MS. All samples were saved and stored at -20''C for HPLC 
analysis and N-terminal sequencing. Peptide alone (without proteasome) was used as a blank 
control: 2 \>L peptide + 4nL Tris buffer (20 mM, pH 7.6) + 1.5mL TFA. 
4. MAT.r)I-TOF MS measurements : 

For each time point 0.3 nL of matiix solution (lOmg/ml a-cyano-4-hydroxycinnaniic acid 
m AcCN/HjO (70:30)) was first applied on a sample slide, and flien an equal volume of digested 
sanq>le was mixed gentiy witii matrix solution rai flie slide. The slide was allowed to dry at 
ambient air for 3-5 min. before acquiring tiie mass spectra. MS was performed on a Lasermat 2000 
MALDI-TOF mass spectrometer that was calibrated with peptide/protein standards. To improve 
the accuracy of measurement, the molecular ion weight (MH*) of tiie peptide substrate was used as 
an internal calibration standard. The mass spectrum of the T= 120 min. digested sanqile is shown in 
figure 4. 
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5. MS data analysis and epitope identification : 

To assign the measured mass peaks, the computer program MS-Product, a tool from the 
UCSF Mass Spectrometry Facility (http:// accessible at 

prospector.ucsf.edu/ucs£htmB.4/insprodlitm), was used to generate aU possible fragments (N- and 
C-termmal ions, and internal fragments) and their corresponding molecular weigjits. Due to the 
sensitivity of the mass spectrometer, average molecular weight was used. The mass peaks 
observed over the course of the digestion were identified as summarized in Table 4, 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEmU or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefiiUy practiced in any order. Although the substrate pq}tide used 
in pioteasomal digest described here was specifically designed to include predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 5. 
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Tflhle 4,SSX-2,t ^.Mass P ^ak Identification. 



MS PEAK 
^measured) 


PEPTIDE 


SEQUENCE 


calculated 
masscmhI 


988.23 


31-37 


YFSKEEW 


989.08 


1377.68±2.3 
8 


31-40 


YFSKEEWEKH 


1377.68 


1662.45±1.3 
0 


31-43 


YFSKEEWEKMKAS 


1663.90 


218l.72±0.8 
5 


31-47 


YFSKEEWEKMKiiSBKl 


2181.52 


2346.6 


31-48 


YFSKEEWEKMKASEKIFX 


2344.71 


l472.l6iL5 
4 


38-49 


EKMKASEKIFYV 


1473.77 


2445.78±L1 
8 


31-49* 


YFSKEEWEKMKASEKIPYV 


2443.84 


2607. 


31-50 


YFSKEEWEKMKASEKIPYVY 


2607.02 


1563.3 


50-61 


YMKRKYBAMTKL 


1562.93 


3989.9 


31-61 


YFSKEEVfEKMKASEKIFYVYMKRKYlSiUaritii 


3987.77 


1603.74±1.5 
3 


51-63 


MKRIOTAMTKLGF 


1603.98 


1766.45±L5 


50-63 


YMKRKX iSfVMTlviitair 


1767.16 


1866.32±1.2 
2 


49-63 


VYMKRKYEAMTKIiGF 


1866.29 


4192.6 


31-63 


YFSKEEWEKMKASEKI FYVYMKRKYEAMTKLG 
F 


4192.00 


4392,1 


31-65** 


■ YFSKBEWEKMKASEKIFYVYMKRKYEAMTKl^ 
FKA 


4391.25 



Boldface sequence correspona w pcpuucs, pi<=uivivv. . ~\ ,o <ft 

* On the basis of mass alone this peak could also have been assigned to the peptde 32-50, howy« 
proteasomal removal of just the N-temrinal amino acid is unlikely. N-termmal sequencmg (below) 

verifies the assignment to 31-49. ^ . , u^i^„, 

»* On the basis^mass this fiagment might also represent 33-68. N-terminal sequencmg below is 

c(»isistent wifii the assignment to 31-6S. 
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Tahlfe S. Pro^«<'t««i Tn.A Mndln^ by proteasomallv generated fragments 



SEOIDNO. 


PEPTIDE 


HLA 


SYFPEITHI 


NIH 


11 


FSKEEWEKM 


B*3501 


NPt 


90 


12 


KMKASEKIF 


B*08 


17 


<5 


13 & (14) 


(K) MKASEKIFY 


Al 


19(19) 


<5 


15 &{16) 


(M) KASEKIFYV 


A*0201 


22 (16) 


1017 


B*08 


17 


<5 


B*5101 


22(13) 


60 


B*5102 


M» 


133 


B*5103 


NP 


121 


17 & (18) 


(K)ASEKIFYVY 


Al 


34 (19) 


14 


19 & (20) 


(K) RKYEAMTKL 


A*0201 


15 


<5 


A26 


15 


NP 


B14 


NP 


45 (60) 


B*2705 


21 


15 


B*2709 


16 


NP 


B*5101 


15 


<5 


21 

22 
23 


KYEAMTKLGF 

YEAMTKLGF 
EAMTKLGF 


Al 


16 


<5 


A24 


NP 


300 


B*4403 


NP 


80 


B*08 


22 


<5 


No prediction 



5 

As seen in Table 5, N-tenninal addition of authentic sequence to epitopes can generate q)itopes for 
the same or different MHC restriction elements. Note in particular the pairing of 
(K)RKYEAMTKL (SEQ ID NOS 19 and (20)) with HLA-B14, where the 10-mer has a longer 

10 predicted halftime of dissociation than the co-C-terminal 9-mer. Also note the case of the 10-mer 

KYEAMTKLGF (SEQ JD NO. 21) which can be used as a vaccine useful with several MHC types 
by relying on N-terminal trimming to create the epitopes for HLA-B*4403 and -B*08. 
6. HLA-'A0201 binding assay : 

Binding of the candidate epitope KASEKIFYV, SSX-24M9, (SEQ ID NO. 15) to HLA- 

15 A2.1 was assayed using a modification of &e method of Stauss et al., (Proc Natl Acad Sci USA 
89(17):7871-5 (1992)). Specifically, T2 cells, which express empty or unstable MHC molecules 
on their surface, were washed twice with Iscove's modified Dulbecco's medium (IMDM) and 
cultured overnight in serum-fi-ee AIM-V medium (Life Technologies, Inc., Rockville, MD) 
supplemented with human B2-microglobulin at 3 fig/ml (Sigma, St Louis, MO) and added peptide, 
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at 800, 400, 200, 100, 50, 25, 12.5, and 6.25 ng/ml.in a 96-well flat-bottom plate at SxlO' cells/200 
Hl/well. Peptide was mixed with the cells by repipeting before distributing to the plate 
(alternatively peptide can be added to individual wells), and the plate was rocked gently for 2 
minutes. Incubation was in a 5% CQz incubator at 37**C. The next day Ihe unbound peptide was 
removed by washing twice with serum free RPMI medium and a saturatmg amount of anti-class I 
HLA monoclonal antibody, fluorescein isothiocyanate (FITC)-conjugated anti-HLA A2, A28 (One 
Lambda, Canoga Park, CA) was added. After incubation for 30 minutes at 4*^C, cells were washed 
3 times with PBS supplemented with 0.5% BSA, 0.05%(w/v) sodium azide, pH 7.4-7.6 (staining 
buffer). (Altematively W6/32 (Sigma) can be used as the anti-class I HLA monoclonal antibody 
the cells washed witii staining buffer and then incubated with fluorescein isothiocyanate (FITC)- 
conjugated goat F(ab') antimouse-IgG (Sigma) for 30 min at 4^ and washed 3 times as before.) 
The cells were resuspended in 0.5 ml staining buffer. The analysis of surface HLA-A2.1 molecules 
stabilized by peptide binding was performed by flow cytometry using a FACScan (Becton 
Dickinson, San Jose, CA). If flow cytometry is not to be performed immediately the cells can be 
fixed by adding a quarter volume of 2% paraformaldehyde and storing in the dark at 4 C, 

The results of the experiment are shown in Figure 5. SSX-241-49 (SEQ ID NO. 15) was 
found to bind HLA-A2.1 to a similar extent as the known A2.1 binder FLPSDYFPSV (HBV18-27; 
SEQ ID NO: 24) used as a positive control. An HLA-B44 binding peptide, AEMGKYSFY (SEQ 
ID NO: 25), was used as a negative control. The fluoresence obtained from the negative control 
was similar to the signal obtained when no peptide was used in the assay. Positive and negative 
control peptides were chosen from Table 18.3.1 in Curreni Protocols in Immunology p. 18.3.2, 
John Wiley and Sons, New York, 1998. 
7. hnmunogenicitv : 

A. In vivo imT miTii7i^tin n of mice . 

HHDl transgenic A*0201 mice (Pascolo, S., et al. J. Exp. Med, 185:2043-2051, 1997) 
were anesthetized and injected subcutaneously at the base of the tail, avoiding lateral tail veins, 
using 100 M-l containing 100 nmol of SSX-241^9 (SEQ ID NO. 15) and 20 |ig of HTL epitope 
peptide in PBS emulsified with 50 ^il of IFA (incomplete Freund's adjuvant). 

B. Preparation of stimulatin g cells (LPS blastsV 

Using spleens from 2 naive mice for each group of immunized mice, un-immunized mice 
were sacrificed and the carcasses were placed in alcohol. Using sterile instruments, the top dermal 
layer of skin on the mouse's left side (lower mid-section) was cut through, exposing the 
peritoneum. The peritoneum was saturated with alcohol, and the spleen was aseptically extracted. 
The spleen was placed in a petri dish with serum-free media. Splenocytes were isolated by using 
sterile plungers from 3 ml syringes to mash the spleens. Cells were collected in a 50 ml conical 
tubes in serum-frcc media, rinsing dish well. Cells were centrifiiged (12000 rpm, 7 min) and 
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washed one time with RPML Fresh spleen cells were resuspended to a concentration of 1x10^ 
cells per ml in RPMI-10%FCS (fetal calf serum). 25g/ml lipopolysaccharide and 7 \i2/ini Dextran 
Sulfate were added Cell wore incubated for 3 days in T-75 flasks at 37'*C, with 5% CO2. Splenic 
blasts were collected in 50 ml tubes pelleted (12000 rpm, 7 min) and resuspended to 3X10^^nil in 
5 RPMI. The blasts were pulsed widi the priming peptide at 50 ng/ml, RT 4hr. mitomycin C-treated 
at 25>ig/ml, 37^C, 20 min and washed three times with DMEM. 

C. In vitro stimulation . 

3 days after LPS stimulation of the blast cells and the same day as peptide loading, the 
primed mice were sacrificed (at 14 days post immunization) to remove spleens as above. 3x10^ 
10 splenocytes were co-cultured with 1x10^ LPS blasts/well in 24-weIl plates at 37''C, vnth 5% CO2 

in DMEM media supplemented with 10% PCS, 5x10-^ M p-mercaptoethanol, lOO^g/ml 
streptomycin and 100 lU/ml penicillin. Cultures were fed 5% (vol/vol) ConA supernatant on day 3 
and assayed for cytolytic activity on day 7 in a 5 ^Cr-release assay. 

D. Chromium-release assay measuring CTL activity . 

15 To assess peptide specific lysis, 2x10^ T2 cells were incubated with 100 ^Ci sodium 

chromate together with 50 jig/ml peptide at 37 C for 1 hour. During incubation th^ were 
gently shaken every 15 minutes. After labeling and loading, cells were washed three times with 10 
ml of DMEM- 10% PCS, wiping each tube with a fresh Kimwipe after pouring off the supernatant. 
Target cells were resuspended in DMEM- 10% FBS IxlOVml. Effector cells were . adjusted to 

20 ixlOVral in DMEM-10% PCS and 100 \i\ serial 3-fold dilutions of effectors were prepared in U- 

bottom 96-well plates. 100 ^1 of target cells were added per well. In order to determine 
spcHitaneous release and maximum release, six additional wells containing 100 \il of target ceUs 
were prepared for each target Spontaneous release was revealed by incubating the target cells with 
100 ^1 medium; maximum release was revealed by incubating the target cells with 100^1 of 2% 

25 SDS. Plates were then centrifuged for 5 min at 600 ipm and incubated for 4 hours at 37^C in 5% 
CO2 and 80% humidity. After the incubation, plates were then centrifuged for 5 min at 1200 rpm. 
Supematants were harvested and counted using a gamma counter. Specific lysis was determined as 
follows: % specific release = [(experimental release - spontaneous release)/(maximum release - 
spontaneous release)] x 100, 

30 Results of the chromium release assay demonstrating specific lysis of peptide pulsed target 

cells are shown in figure 6. 
8. Cross-reactivitv with other SSX proteins : 

SSX-24M9 (SEQ E> NO. 15) shares a high degree of sequence identity with the same region 
of the other SSX proteins. The surrounding regions have also been generally well conserved. Thus 

35 the housekeeping protcasome can cleave following V49 in all five sequences. Moreover, SSX41U19 is 
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predicted to bind HLA-A*0201 (see Table 6), CTL generated by immunization with SSX-24M9 
cross-react with tumor cells expressing other' SSX proteins. 



Table 6. SSX^hq - A*0201 Predicted Binding 



SEQIDNO. 


Famiiy 
Member 


Sequence 


SYFPErrm 

Score 


Nm 

Score 


15 


SSX.2 


KASEOFYV 


22 


1017 


26 


SSX-1 


KYSEKISYV 


18 


1.7 


27 


SSX.3 


KVSEKIVYV 


24 


1105 


28 


SSX-4 


KSSEKIVYV 


20 


82 


29 


SSX-5 


KASEKHYV 


22 


175 



Example 4 

Quster Analysis (PSMAi63.i92)- 

A peptide, AFSPQGMPEGDLVYVNYARTEDFFKLERDM, PSMAi63.,92, (SEQ ID NO. 
30), containing an Al epitope cluster from prostate specific membrane antigen, PSMA168.190 (SEQ 
ID NO. 31) was synthesized using standard solid-phase f-moc chemistry on a 433A ABI Peptide 
synthesizer. After side chain dcprotection and cleavage from the resin, peptide first dissolved in 
formic acid and then diluted into 30% Acetic acid, was run on a reverse-phase preparative HPLC 
C4 column at following conditions: linear AB gradient ( 5% B/mm) at a flow rate of 4 ml/min, 
where eluent A is 0.1% aqueous TFA and cluent B is 0.1% TFA in acetonitrile. A fraction at time 
16.642 min containing the expected peptide, as judged by mass spectrometry, was pooled and 
lyophilized. The peptide was then subjected to proteasome digestion and mass spectrum analysis 
essentially as described above. Prominent peaks from the mass spectra are summari25ed in Table 7. 
Table 7. PSMAi/i^uiq? Mass Peak Identification. 



PEPTIDE 


SEQUENCE 


CALCULATE 
DMASS (MH*) 


163-177 


AFSPQGMPEGDLVYV 


1610.0 


178-189 


NYARTEDFFKLE 


1533.68 


170-189 


PEGDLVYVlsrYARTEDFFKLE 


2406.66 


178-191 


NYARTEDPFKLERD 


1804.95 


170-191 


PEGDLVYVNYARTEDFFKLERD 


2677.93 


178-192 


NYARTEDFFKLERDM 


1936.17 


163-176 


AFSPQ6MPBGDLW 


1511.70 


177-192 


VNYARTEDPFKIiERDM 


2035,30 


163-179 


AFSPQGMPEGDLVZVNY 


1888.12 
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1 RO-1 09 


ARTEDFFKLERDM 


1658.89 


163-183 


AFSPQGMPEGDLVYVNYARTE 


2345.61 


184-192 


DFFKLERDM 


1201.40 


176-192 


YVNYARTEDFFKLERDM 


2198.48 


167-185 


QGMPEGDLVYVNYARTEDP 


2205.41 


178-186 


NYARTEDFF 


1163.22 



Boldface sequences correspond to peptides predicted to bind to MHC, see Table 8. 



N-terminal Pool Sequence Analysis 

One aliquot at one Hour of the proteasomal digestion (see Example 3 part 3 above) was 
subjected to N-terminal amino acid sequence analysis by an ABI 473A Protein Sequencer (AppUed 
Biosystems, Foster City, CA). Determination of the sites and efficiencies of cleavage was based on 
consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative 
yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed 
sequence) residue X appears only in the nth cycle a cleavage site exists n-1 residues before it in the 
N-terminal direction. In addition to helping resolve any ambiguity in the assignment of mass to 
sequences, these data also provide a more reliable indication of the relative yield of the various 
fragments than does mass spectrometry. 

For PSMAi63-i92 (SEQ ID NO. 30) this pool sequencing supports a single major cleavage 
site after V177 and several minor cleavage sites, particularly one after ¥179- Reviewing the results 
presented in figures 7 A-C reveals the following: 

S at the 3"^ cycle indicating presence of the N-tenninus of the substrate. 

Q at the 5* cycle indicating presence of the N-terminus of the substrate. 

N at the 1** cycle indicating cleavage after Vm- 

N at the 3"* cycle indicating cleavage after V175. Note the jfragment 176-192 in Table 7. 
T at the 5* cycle indicating cleavage after Vn?- 

T at the 1** -3"^ cycles, indicating increasingly common cleavages after Riai, Aigo and Y179. 

Only the last of these correspond to peaks detected by mass spectrometry; 163-179 and 

180-192, see Table 7. The absence of the others can indicate that they are on 

fragments smaller than were examined in the mass spectrum. 
K at the 4*, 8^ and 10* cycles indicating cleavages after E183, Y179, and V177, respectively, 
• all of which correspond to jfragments observed by mass spectroscopy. See Table 7. 
A at the l" and 3"* cycles indicating presence of the N-tenninus of the substrate and 

cleavage after V177, respectively. 
P at the 4**" and 8**" cycles indicating presence of the N-terminus of the substrate. 



-55- 



wo 2003/008537 



PCTAJS2002/010189 



G at the 6* and 10*^ cycles indicating presence of the N-terminus of the substrate. 
M at the 7* cycle indicating presence of the N-tenninus of Ac substrate and/or cleavage 
after Fiss* 

M at the 15* cycle indicating cleavage after V177. 
The 1** cycle can indicate cleavagp after D^i, see Table 7. 
R at the 4* and 13* cycle indicating cleavage after V177. 
R at the 2"* and 1 1* cycle indicating cleavage after Y179. 

V at die 2"^, 6\ and 13*^ cycle indicating cleavage after Vps, M169 and presence of the N- 

terminus of the subsliate, respectively. Note fragments beginning at 176 and 170 in 
Table 7. 

Y at the 1"^, 2^, and 14* cycles indicating cleavage after V175, V,77, and presence of the N- 

terminus of the substrate, respectively. 
L at thel 1* and 12* cycles indicating cleavage after Vn?, and presence of the N-terminus 
of the substrate, respectively, is the interpretation most consistent with the other data. 
Comparing to the mass spectrometry results we see that L at the 2"*, 5*, and 9* cycles 
is consistent with cleavage after F186, Ewa or M169, and Ym, respectively. See Table 7. 
Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPErrm or NTH algorithms were chosen for further analysis. The digestion and prediction 
steps of the procedure can be useftilly practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include a predicted HLA-Al 
binding sequence, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 8. 
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Table 8. Predicted HLA binding by proteasomally generated fragments 



SEQ ID NO 


PEPTIDE 


HLA 


SYFPEITHI 


NIH 


32 & (33) 


(G) MPEGDLVY 
V 


A*0201 


17 (27) 


(2605) 






B*0702 


20 


<5 






B*5101 


22 


314 


34 & (35) 


(Q)GMPEGDLV 
Y 


Al 


24(26) 


<5 






A3 


16 (18) 


36 






B*2705 


17 


25 


36 


MPEGDLVY 


B*5101 


15 


NPt 


37 & (38) 


(P)EGDLVYVN 
Y 


Al 


27 (15) 


12 






A26 


23 (17) 


MP 


39 


LVYVNYARTE 


A3 


21 


<5 


40 & (41) 


(Y)VNYARTED 
F 


A26 


(20) 


NP 






B*08 


15 


<5 






B*2705 


12 


50 


42 


NYARTBDFF 


A24 


NPt 


100 






C>v*0401 


MP 


120 


43 


YARTEDFF 


B*08 


16 


<5 


44 


RTEDFFKLE 


Al 


21 


<5 






A26 


15 


NP 



'No prediction 



5 HLA-A*02D1 binding assay: 

HLA-A*0201 binding studies were prefoimed with PSMAieg-m, GMPEGDLVYV, (SEQ 
ID NO. 33) essentially as described in Example 3 above. As seen in figure 8, this epitope exhibits 
significant binding at even lower concentrations than the positive control p^tides. The Melan-A 
peptide used as a control in this assay (and throughout this disclosure), ELAGIGILTV, is actually a 
10 variant of the natural sequence (EAAGIGILTV) and exhibits a high affinity in this assay. 
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Example 5 

Cluster Analysis (PSMAjsi-aio)- 

Another peptide, RGIAEAVGLPSIPVHPIGYYDAQKLLEKMG, PSMA28,.3io, (SEQ ID 
NO. 45), containing an Al epitope cluster from prostate specific membrane antigen, PSMA283-307 
(SEQ ID NO. 46), was synthesized using standard solid-phase F-moc chemistry on a 433A ABI 
Peptide syntiiesizer. After side chain deprotection and cleavage from the resin, peptide in ddH20 
was run on a reverse-phase preparative HPLC C18 column at following conditions: linear AB 
gradient (5% B/min) at a flow rate of 4 ml/min, where eluent A is 0.1% aqueous TFA and eluent B 
is 0.1% TFA in acetonitrile. A fraction at time 17,061 min containing the expected peptide as 
judged by mass spectrometry, was pooled and lyophilized. The peptide was then subjected to 
proteasome digestion and mass spectrum analysis essentially as described above. Prominent peaks 
from the mass spectra are summarized in Table 9. 



Table 9. PSMA^«,-atn Mass Peak IdentificatioiL 



PEPTIDE 


SEQUENCE ■ 


CALCULATK 
DMASS(MH*) 


281-297 


R6IAEAVGLPSIPVHPI* 


1727.07 


286-297 


AVGLPSIPVHPI** 


1200.46 


287-297 


VGLPSIPVHPI 


1129.38 


288-297 


GLPSIPVHPI' 


1030.25 


298-310 


GYYDAQKLLEKMGt 


1516.5 


298-305 


GyYDAQKL§ 


958.05 


281-305 


RGI AEAVGLPS I PVHPIGYYDAQKL 


2666.12 


281-307 


RGIAEAVGLPSIPVHPIGYYDAQKLLE 


2908.39 


286-307 


AVGLPS I PVHP I GYYDAQKLLE H 


2381.78 


287-307 


VGIiPSI PVHPIGYYDAQKLLE 


2310.70 


288-307 


GLPSI PVHPIGYYDAQKLLE# 


2211.57 


281-299 


RGIAEAVGLPSIPVHPIGY 


1947 


286-299 


AVGLPSIPVHPIGY 


1420.69 


287-299 


VGLPSIPVHPIGY 


1349.61 


288-299 


GLPSIPVHPIGY 


1250.48 


287-310 


VGLPSI PVHPIGYYDAQKLLEKMG 


2627.14 


288-310 


GLPSIPVHPIGYYDAQKLLEKMG 


2528.01 



Boldface sequences correspond to peptides predicted to bind to MHC, see Table 10. 
♦By mass alone this peak could also have been 296-3 10 or 288-303. 

**By mass alone this peak could also have been 298-307, Combination of HPLC and mass 
spectrometry show that at some later time points this peak is a mixture of both species, 
f By mass alone this peak could also have been 289-298. 
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By mass alone this peak could also have been 281-295 or 294-306. 
§ By mass alone this peak could also have been 297-303. 
f By mass alone this peak could also have been 285-306. 
# By mass alone this peak could also have been 288-303. 

None of these alternate assignments are supported N-terminal pool sequence analysis. 

N-terminal Fool Sequence Analysis 

One aliquot at one hour of the proteasomal digestion (see Example 3 part 3 above) was 
subjected to N-tenninal amino acid sequaice analysis by an ABI 473A Protein Sequencer (AppHed 
Biosystems, Foster City, CA). Determination of the sites and efficiencies of cleavage was based on 
consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative 
yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed 
sequence) residue X appears only in the nth cycle a cleavage site exists n-1 residues before it in the 
N-terminal direction. In addition to helping resolve any ambiguity in the assignment of mass to 
sequences, these data also provide a more reliable indication of the relative yield of the various 
fragments than does mass spectrometry. 

For PSM A28i.3io( SEQ ID NO. 45) this pool sequencing supports two major cleavage sites 
after V287 and I297 among other minor cleavage sites. Reviewing the results presented in Fig. 9 
reveals the following: 

S at the 4* and 1 1* cycles indicating cleav^e after V287 and presence of the N-terminus of 

the substrate, respectively. 
H at the S'^ cycle indicating cleavage after V287. The lack of decay in peak height at 
positions 9 and 10 versus the drop in height present going from 10 to 11 can suggest 
cleavage after A286 and E28S as well, rather than the peaks representing latency in the 
sequencing reaction. 

D at the 2"*, 4*^, and 7* cycles indicating cleavages after Y299, 1297, and V294, respectively. 

This last cleavage is not observed in any of the fragments in Table 10 or in the 

alternate assignments in the notes below. 
Q at the 6* cycle indicating cleavage after I297. 

M at the 10* and 12* cycle indicating cleavages after ¥299 and I297, respectively. 
Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be useftilly practiced in any order. Altiiough the substrate pq)tide used 
in proteasomal digest described here was specifically designed to include a predicted HLA-Al 
binding sequence, the actual products of digestion can be checked after the ftict for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 10. 
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Table 10. 

Predicted HLA bindiac bv proteasomallv generated fragments: PSMA«,,,,n 



SEQroNO. 


PEPTIDE 




SYFPEITHI 

lUf tLMrJL MUX JL.XJUU 


NIH 


47 & (48) 


(G) LPSIPVH 
PI 


A*0201 


16(24) 


(24) 


B*0702/B7 


23 


12 


B*5101 


• 24 


572 


Cw*0401 


NPt 


20 


49 & (50) 


(P) IGYYDAQ 
KL 


A*0201 


(16) 


<5 


A26 


(20) 


M» 


B*2705 


16 


25 


B*2709 


15 


NP 


B*510l 


O 1 

21 


j/ 


Cw*0301 


NP 


24 


51 & (52) 
53 


(P)SIPVHPI 
GY 

IPVHPIGY 


Al 


21 (27) 


<5 


A26 


22 


NP 


A3 


16 


<5 


B*5101 


16 


NP 


54 


YYDAQKLLE 


Al 


22 


<5 



tNo prediction 

5 

As seen in Table 10, N-terminal addition of authentic sequence to epitopes can often generate still 
useful, even better epitopes, for the same or different MHC restriction elements. Note for example 
the pairing of (G)LPSIPVHPI with HLA-A*0201, where the 10-mer can be used as a vaccine 
useful with several MHC types by relying on N-terminal trimming to create the epitopes for 
10 HLA-B7, -B*5101, and Cw*0401. 

HLA-A*0201 binding assay: 

HLA-A*0201 bmding studies were preformed with PSMA288-297, GLPSIPVHPI, (SEQ ID 
NO. 48) essentially as described in Examples 3 and 4 above. As seen in figure 8, this epitope 
exhibits significant binding at even lower concentrations than the positive control peptides. 
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Example 6 

Cluster Analysis (PSMA4S4-48i)- 

Anoflier peptide, SSIEGNYTI^VDCTPLMYSLVHLTKEL, PSMA4S4^i, (SEQ ED NO. 
55) containing an epitope cluster from prostate specific membrane antigen, was synthesized by 
MPS (purity >95%) and subjected to proteasome digestion and mass spectrum analysis as 
described above. Prominent peaks from the mass spectra are summarized in Table 1 1. 



Table 11. PSMA^^^st Mass Peak Identiflcation. 



MS PEAK 
(measured) 


PEPTIDE 


SEQUENCE 


CALCULATED 
MASS (MET) 


1238.5 


454-464 


SSIEGNVTIiRV 


1239.78 


1768.38i0.60 


454469 


SSIECaJYTLRVDCTPL 


1768.99 


1899.8 


454470 


SSI EGNYTLRVDCTPLM 


1900.19 


1097.63±0.91 


463-471 


RVDCTPLMy 


1098.32 


2062.87±0.68 


454-471* 


SSIEGNYTIiRVDCTPLMir 


2063.36 


1153 


472-481** 


SLVHNLTKEL 


115436 


1449.93il-79 


470^81 


MYSLVHNLTKEL 


1448.73 



Boldface sequence correspond to peptides predicted to bind to MHC, see Table 12. 

* On the basis of mass alone this peak could equally well be assigned to Hqq peptide 455-472 

however proteasomal removal of just the N-terminal amino acid is considered unlikely. If the issue 

were important it could be resolved by N-terminal sequencing. 

'I'^On the basis of mass this iragment might also represent 455-464. 



Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of tiie procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to mclude predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the &ct for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 12. 
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Table 12. Predicted HLA binding by proteasomally generated fragments 



SEOIDNO 


PEPUDE 


HLA 


SYFPEITHI 


NIH 


56 & (57) 


(S)IEGNYTLRV 


A 1 








EAxH X ± JJXv V 




lo (22 j 


<J 


lUl 


1 c 


JNrj 


59 & (60) 


(Y) TLRVDCTPL 


A ^lAO A 1 


'%f\ f\ Q\ 


(5) 




1A /1Q\ 

10 {^lo) 


XTD 

JNr 


D / 


14 




"DC 




<J 


D^Z /{JO 


1 0 


lA 


U 1 


IN It 




61 


LRVDCTPLM 








B*2709 


20 


NP 


62 & (63) 


(L)RVDCTPIiMY 


Al 


32 (22) 


125(13.5) 


A3 


25 


<5 


A26 


22 


NP 


B*2702 


NP 


(200) 


B*2705 


13 (NP) 


(1000) 



fNo prediction 



5 As seen in Table 12, N-terminal addition of authentic sequence to epitopes can often generate still 

useful, even better epitopes, for the same or different MHC restriction elements. Note for example 
the pairing of (L)RVDCTPLMY (SEQ ID NOS 62 and (63)) with HLA-B*2702/5, where the 10- 
mer has substantial predicted halftimes of dissociation and the co-C-terminal 9-mer does not. Also 
note the case of SBSGNYTLRV (SEQ ID NO 57) a predicted HLA-A*0201 epitope which can be 
10 used as a vaccine useful with HLA-B*5101 by relying on N-terminal trimming to create the 
epitope. 

HLA-A*0201 binding assay 

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 
above, with PSMA46o^, TLRVDCTPL, (SEQ ID NO. 60). As seen in figure 10, this epitope was 
15 found to bind HLA-A2.1 to a similar extent as the known A2,l binder FLPSDYFPSV (HBVig.27; 

SEQ ID NO: 24) used as a positive control. Additionally, PSMA46i^69 , (SEQ ID NO. 59) binds 
nearly as well. 

ELISPOT analysis: PSMA^^(SEQ ID NO. 62) 
20 The wells of a nitrocellulose-backed microtiter plate were coated with capture antibody by 

incubating overnight at 4°C using 50 |il/well of 4^g/nil murine anti-human y-IFN monoclonal 
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ant'body in coating buffer (35 mM sodium bicarbonate, 15 mM sodium carbonate, pH 9.5). 
Unbound antibody was removed by washing 4 times 5 min. with PBS. Unbound sites on the 
membrane then were blocked by adding aOOixl/well of RPMI medium wifli 10% serum and 
incubating 1 hr. at room temperature. Antigen stimulated CDS'" T cells, in 1:3 serial dilutions, 
were seeded into flie wells of the microliter plate using lOOnI/well, starting at 2x10^ cells/well. 
(Prior antigen stimulation was essentially as described in Scheibenbogcn, C. et al. Int 1 Cancer 
71:932-936, 1997. PSMA462^7i (SEQ ID NO. 62) was added to a fmal concentration of lOng/ml 
and IL-2 to 100 U/ml and the cells cultured at 37°C in a 5% CO2, water-saturated atmosphere for 
40 hrs. Following this incubation the plates were washed with 6 times 200 |il/well of PBS 
containing 0.05% Tween-20 (PBS-Tween). Detection antibody, 50fil/well of 2g/ml biotinylated 
murine anti-human y-IFN monoclonal antibody in PBS+10% fetal calf serum, was added and the 
plate incubated at room temperature for 2 hrs. Unbound detection antibody was removed by 
washing with 4 times 200 fil of PBS-Tween. lOOiil of avidin-conjugated horseradish peroxidase 
(Pharmingen, San Diego, CA) was added to each well and incubated at room tenperature for I hr. 
lAibound enzyme was removed by washing with 6 times 200 ^1 of PBS-Tween. Substrate was 
prepared by dissolving a 20 mg tablet of 3-amino 9-ethylcoarbasolc in 2.5 ml of N, N- 
dimethylformamide and adding that solution to 47,5 ml of 0.05 M phosphate-citrate buffer (pH 
5.0). 25 ^1 of 3OV0 H2O2 was added to the substrate solution immediately before distributing 
substrate atlOO ^I/well and incubating the plate at room temperature. After color development 
(generally 15-30 min.), the reaction was stopped by washing the plate with water. The plate was 
air dried and the spots counted using a stcreomicroscope. 

Figure 11 shows the detection of PSMAiea-^?! (SEQ ID NO. 62)-reactive HLA-Al"^ CD8* T 
cells previously generated in cultures of HLA-Al* CD8* T cells with autologous dendritic cells 
plus the peptide. No reactivity is detected from cultures without peptide (data not shown). Li this 
case it can be seen that the peptide reactive T cells are present in the culture at a frequency between 
1 in 2.2x1 0^ and 1 in 6.7xl0^ That this is truly an HLA-Al-restricted response is demonstrated by 
the ability of anti-HLA-Al monoclonal antibody to block y-IFN production; see figure 12, 
Example 7 

Cluster Analysis (PSMA^ss^y). 

Another peptide, FDKSNPIVLRMMNDQLMFLERAFmPLGL^^ PSMA^3^7, 
(SEQ ID NO, 64) containing an A2 epitope cluster from prostate specific membrane antigen, 
PSMA«io.68i (SEQ ID NO 65), was synthesized by MPS (purity >95%) and subjected to proteasome 
digestion and mass spectrum analysis as described above. Prominent peaks from the mass spectra 
are summarized in Table 13, 
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Table 13. PSMA/jgww? Mass Peak Identification. 



MS PEAK 
(measured) 


PEPTIDE 


SEQUENCE . 


CALCULATED 
MASSOVIH^ 


906.17±0.65 


681-687** 


LPDRPPY 


908.05 


1287.73±0.76 


677-687** 


DPLGLPDRPFY 


1290,47 


1400.3±L79 


676-687 


IDPLGLPDRPFY 


1403.63 


1 </IC A4-1 in 




FIDPIjGLPDRPFY 


1550.80 






AF IDPLGLPDRPFY 


1621.88 


1775.48±1.32 


673-687* 


RAFIDPLGLPDRPFY 


1778.07 


2440.2±1.3 


653-672 


FDKSNPIVLRMMNDQLMFLE 


2442.93: 


1904.63±1. 56 


672-687* 


ERAF IDPLGLPDRPFY 


1907.19 


2310.6±2.5 


653-671 


FDKSNPIVLRMMNDQLMFL 


2313.82 


2017.4±1.94 


671-687 


LERAFIDPLGLPDRPFY 


2020.35 


2197.43±1.78 


653-670 


FDKSNPIVLRMHMaDQLMF 


2200.66 



Boldface sequence correspond to peptides predicted to bind to MHC, see Table 13, 

* On the basis of mass alone this peak could equally well be assigned to a peptide beginning at 
654, however proteasomal removal of just the N-terminal amino acid is considered unlikely. If the 
issue were important it could be resolved by N-terminal sequencing. 

** On the basis of mass alone these peaks could have been assigned to internal fragments, but 
given the overall pattern of digestion it was considered unlikely. 

Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYPPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include predicted HLA-A2,1 
binding sequences, the actual products of digestion can be checked after the feet for actual or 
predicted binding to other MHC molecules. Selected results arc shown in Table 14. 
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Table 14. Predicted HLA binding by proteasomally generated fragments 



SEQIDNO 


PEPTIDE 


HLA 


SYFPEITHI 


NIH 


66&(67) 


(R>MMNDQLMF 
L 


A*0201 


24 (23) 


1360(722) 


A*0205 


NPt 


71 (42) 


A26 


15 


NP 


B*2705 


12 


50 


68 


RMMNDQLMF 


B*2705 


17 


75 



tNo prediction 



As seen in Tabic 14, N-terminal addition of authentic sequence to epitopes can generate 
still useful, even better epitopes, for the same or different MHC restriction elements. Note for 
example the pairing of (R)MMNDQLMFL (SEQ ID NOS, 66 and (67)) with HLA-A*02, where ttie 
10-mer retains substantial predicted binding potential. 

HLA-A*0201 binding assay 

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 
above, with PSMA^dj^i, (SEQ ID NO. 66) and PSMA«2-67i, RMMNDQLMFL (SEQ NO. 67). As 
seen in figures 10, 13 and 14, this epitope exhibits significant binding at even lower concentrations 
than the positive control peptide (FLPSDYFPSV (HBV,g^7); SEQ ID NO: 24). Though not run in 
parallel, comparison to die controls suggests that PSMA€62-67i (which approaches the Melan A 
peptide in affmity) has the superior binding actiAaty of these two PSMA peptides. 
Example 8 

Vaccinating with epitope vaccines. 
1. Vaccination with peptide vaccines : 
A Intranodal delivery 

A forauilation containing peptide in aqueous buffer with an antimicrobial agent, an 
antioxidant, and an immunomodulating cytokine, was injected continuously over several days into 
the ingumal lymph node using a miniature pumping system developed for insulin delivery 
(MiniMed; Northridge, CA). This infusion cycle was selected in order to mimic the kinetics of 
antigen presentation during a natural infection. 

B. Controlled release 

A peptide formulation is delivered using controlled PLGA microspheres as is known in the 
art, which alter the pharmacokinetics of the peptide and improve immunogenicity. This 
formulation is injected or taken orally. 
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C. Gene gun delivery 

A peptide formulation is prepared wherein the peptide is adhered to gold microparticles as 
is known in the art. The particles are delivered in a gene gun, being accelerated at high speed so as 
to penetrate die skin, carrying tiic particles into .dermal tissues that contain pAPCs. 

D. Aerosol delivery 

A peptide formulation is inhaled as an aerosol as is known in the art, for uptake into 
appropriate vascular or lymphatic tissue in the lungs. 
2. Vaccination with nucleic acid vaccines : 

A nucleic acid vaccine is injected into a lymph node using a miniature pumping system, 
such as the MiniMed insulin pump. A nucleic acid construct formulated in an aqueous buffered 
solution containing an antimicrobial agent, an antioxidant, and an immunomodulating cytokme, is 
delivered over a several day infusion cycle in order to mimic the kinetics of antigen presentation 
during a natural infection. 

Optionally, the nucleic acid construct is delivered using controlled release substances, such 
as PLGA microspheres or other biodegradable substances. These substances are injected or taken 
orally. Nucleic acid vaccines are given using oral delivery, priming the immune response through 
uptake into GALT tissues. Alternatively, the nucleic acid vaccines are delivered using a gene gun, 
wherein the nucleic acid vaccine is adhered to minute gold particles. Nucleic acid constructs can 
also be inhaled as an aerosol, for uptake into appropriate vascular or lymphatic tissue in the lungs. - 
£xample 9 

Assays for the effectiveness of epitope vaccmes. 

1. Tetramer analvsis : 

Class I tetramer analysis is used to determine T cell frequency in an animal before and 
after administration of a housekeeping epitope. Clonal expansion of T cells in response to an 
epitope indicates that the epitope is presented to T cells by pAPCs. The specific T cell frequency 
is measured against the housekeeping epitope before and afler administration of the epitope to an 
animal, to determine if the epitope is present on pAPCs. An increase in frequency of T cells 
specific to the epitope after adminislration indicates that the epitope was presented on pAPC. 

2, Proliferation assay : 

Approximately 24 hours after vaccination of an animal with housekeeping epitope, pAPCs 
are harvested from PBMCs, splenocytes, or lymph node cells, using monoclonal antibodies against 
specific markers present on pAPCs, fixed to magnetic beads for affinity purification. Crude blood 
or splenoctye preparation is enriched for pAPCs using this technique. The enriched pAPCs are 
then used in a proliferation assay against a T cell clone that has been generated and is specific for 
the housekeeping epitope of interest. The pAPCs are coincubated with the T cell clone and the T 
cells are monitored for proliferation activity by measuring the incorporation of radiolabeled 
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thymidme by T cells. Proliferation indicates that T cells specific for the housekeeping epitope are 
being stimulated by that epitope on tiie pAPCs. 
3. Chromium release assay : 

A human patient, or non-human animal genetically engineered to express human class I 
5 MHC, is immunized using a housekeeping epitope. T cells from the immunized subject are used in 
a standard chromium release assay using hmnan tumor targets or targets engineered to express the 
same class I MHC. T cell killing of the targets indicates that stimulation of T cells in a patient 
would be effective at killing a tumor expressing a similar TuAA. 
Example 10 

10 Induction of CTL response with naked DNA is effideat by Intra-lymph node immunization* 

In order to quantitatively compare tiie CDS*^ CTL responses induced by different routes of 
immunization a plasmid DNA vaccine (p£GFPL33A) containing a well-characterized 
immunodominant CTL epitope from the LCMV-glycoprotein (G) (gp33; amino acids 33-41) 
(Oehen, S., et al.. Immunology 99, 163-169 2000) was used, as this system allows a comprehensive 

15 assessment of antiviral CTL responses. Groups of 2 C57BL/6 mice were immunized once with 

titrated doses (200-0.02 jug) of pEGFPLSSA DNA or of control plasmid pEGFP-N3, administered 
i.m, (intramuscular), i.d. (intradermal), i.spl. (intrasplenic), or i.ln. (intra-lymph node). Positive 
control mice received 500 pfu LCMV i,v. (intravenous). Ten days after immunization spleen cells 
were isolated and gp33-specific CTL activity was determined after secondaiy in vitro 

20 restimulation. As shown in Fig. 15, i.m. or i.d. immunization induced weakly detectable CTL 

responses when hi^ doses of pEFGPL33 A DNA (200^g) were administered. In contrast, potait 
gp33-specific CTL responses were elicited by immunization with only 2fig pEFGPL33A DNA 
i.spl. and with as little as 0J2^g pEFGPL33A DNA given Lhi. (figure 15; symbols represent 
individual mice and one of three similar experiments is shown). Immunization with the control 

25 p£GFP-N3 DNA did not elicit any detectable gp33-specific CTL responses (data not shown). 
Example 11 

Intra-lymph node DNA immunization elicits anti-tumor immunity. 

To examine whether the potent CTL responses elicited following i.ln. immunization were 
able to confer protection against peripheral tumors, groups of 6 C57BL/6mice were immunized 

30 three times at 6-day intervals with lO^ig of pEFGPL33A DNA or control pEGFP-N3 DNA. Five 

days after the last immunization small pieces of solid tumors e?q)ressing the gp33 epitope (EL4-33) 
were transplanted s.c. into both flanks and tumor growth was measured every 3-4d. Although &e 
EL4-33 tumors grew well in mice that had been repetitively immimized with control pEGFP-N3 
DNA (figure 16), mice which were immunized with pEFGPL33A DNA i.hi. rapidly eradicated the 

35 peripheral EL4-33 tumors (figure 16). 

-67- 



wo 2003/008537 



PCTAJS2002/010189 



Example 12 

Differences In lymph node DNA content mirrors differences in CTL response following intra- 
' lymph node and intramuscular injection. 

pEFGPL33 A DNA was injected i.ln. or im and plasmid content of the injected or draining 
5 lymph node was assessed by real time PGR after 6, 12, 24, 48 hours, and 4 and 30 days. At 6, 12, 
and 24 hours the plasraid DNA content of the injected lymph nodes was approximately three orders 
of magnitude greater than that of the draining lymph nodes following i.m. injection. No plasmid 
DNA was detectable in the draining lymph node at subsequent time points (Fig. 17). This is 
consonant with the three orders of magoitude greater dose needed using ijn. as compared to i.ln. 
10 injections to achieve a similar levels of CTL activity. CD8^" knockout mice, which do not develop 
a CTL re^onse to this epitope, were also injected i.hL showing clearance of DNA from the lymph 
node is not due to CDS* CTL killing of cells in the lymph node. This observation also supports the 
conclusion that i.hi. administration will not provoke immunopathologicai damage to the lymph 
node. 

15 Example 13 

Administration of a DNA plasmid formulation of a therapeutic vaccine for melanoma to 
humans. 

SYNCHROTOPE TA2M, a melanoma vaccine, encoding the HLA-A2-restricted tyrosinase 
epitope SEQ ID NO. 1 and epitope cluster SEQ ID NO. 69, was formulated in 1% Benzyl alcohol, 

20 1% ethyl alcohol, 0.5mM EDTA, citrate-phosphate, pH 7.6. Aliquots of 80, 160, and 320 jig 

DNA/ml were prepared for loading into MINIMED 407C infusion pun^js. The catheter of a 
SILHOUETTE infusion set was placed into an inguinal lymph node visualized by ultrasound 
im^ng. The assembly of pimq> and infusion set was originally designed for the delivery of insulin 
to diabetics and the usual 17mm catheter was substituted with a 31mm catheter for this application. 

25 The infusion set was kept patent for 4 days (approximately 96 hours) with an infusion rate of about 

25 jil/hour resulting in a total infused volume of approximately 2.4 ml. Thus the total administered 
dose per infusion was approximately 200, and 400 ng; and can be 800 ^g, respectively, for the 
three concentrations described above. Following an infusion subjects were given a 10 day rest 
period before starting a subsequent infusion. Given the continued residency of plasmid DNA in the 

30 lymph node after admmistration (as in example 12) and the usual kinetics of CTL response 
following disappearance of antigen, this schedule will be sufficient to maintain the immunologic 
CTL response. 
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Example 14 
Additional Epitopes. 

Ttie methodologies described above, and in particular in examples 3-7, have been applied 
to additional synthetic peptide substrates, leading to the identification of further epitopes as set for 
5 the in tables 15-36 below. The substrates used here were designed to identify products of 

housekeepmg proteasomal processing that give rise to HLA-A*0201 binding epitopes, but 
additional MHC-binding reactivities can be predicted, as discussed above. Many such reactivities 
are disclosed, however, these listings are meant to be exemplary, not exhaustive or limiting. As 
also discussed above, individual components of the analyses can be used in varying combinations 

10 and orders. The digests of the NY-ESO-1 substrates 136-163 and 150-177 (SEQ ID NOS. 254 and 

255, respectively) yielded fragments that did not fly well in MALDI-TOF mass spectrometry. 
However, they were quite amenable to N-terminal peptide pool sequencing, hereby allowing 
identification of cleavage sites. Not all of the substrates necessarily meet die formal definition of 
an epitope cluster as referenced in example 3. Some dusters are so large, e.g. NY-ESQ-lgMTb that 

15 it was more convenient to use substrates spanning only a portion of diis cluster. In other cases, 

substrates were extended beyond clusters meeting the formal definition to include neighboring 
predicted epitopes. In some instances, actual binding activity may have dictated what substrate was 
made, as with for example the MAGE epitopes reported here, where HLA binding activity was 
determined for a selection of peptides with predicted affinity, before synthetic substrates were 

20 designed. 
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SlLgmeUhood of Epitope Cross-reactivity onNon^arget Tissues. 

As noted above PSA is a menAer of the kallikrcin femily of proteases, which is itself a 
subset of the serine protease femily. While the members of this femily sharing the greatest degree 
of sequence identity ^vifc PSA also share similar expression profiles, it remains possible that 
individual epitope sequences might be shared with proteins having distinctly different expression 
profiles A first step in evaluating the likelihood of undesirable cross-reactivity is the identification 
of shared sequences. One way to accomplish this is to conduct a BLAST search of an epitope 
sequence against the SWISSPROT or Entrez non-redundant peptide sequence databases usmg the 
"Search for short nearly exact matches" option; hypertext transfer protocol accessible on the world 
wide web {http://www) at ••ncbi.ntauiih.gov/blastfindex.html". Tim searching SEQ ID NO. 214. 
WVLTAAHCI. against SWISSPROT Oimited to entries for homo sapiens) one finds four exact 
nuitches. including PSA. The other three are from kallikrein 1 (tissue kallikrein). and elastase 2A 
and 2B. While these nine amino acid segments are identical, the flanking sequences are quite 
distinct particularly on the C-termmal side, suggesting that processing may proceed differently and 
that thus flie same epitope may not be liberated from these other proteins. (Please note that 
kallikrein naming is confused. Tl^us the kallikrein 1 [accession number P06870] is a different 
protein than the one [accession nurriber AAD13817] mentioned in the paragraph on PSA above m 
the section on tumor-associated antigens). 

It is possible to test this possibility in several ways. Synthetic peptides containing the 
epitope sequence erribedded in fee context of each of these proteins can be subjected to in vitro 
proteasomal digestion and analysis as described above. Alternatively, cells expressing these other 
proteins, whether by natural or recombinant expression, can be used as targets in a cytotoxicity (or 
similar) assay using CD8*T cells lhat recognize the epitope, in order to determine if the epitope is 



Example 16 
Epitope Clusters. 

Known and predicted epitopes are generally not evenly distributed across the sequences of 
protein antigens. As referred to above, we have defined segments of sequence containing a higher 
than average density of (known or predicted) epitopes as epitope clusters. Among the uses of 
epitope clusters is the incorporation of their sequence into substrate peptides used in proteasomal 
digestion analysis as described herein. Epitope clusters can also be useful as vaccine components. 
A fidler discussion of the defmition and uses of epitope dusters is found in U.S. Patent Application 
No 09/561.571 entitled EPITOPE CLUSTERS, previously incorporated by reference. 

The foUowing tables (37-60) present 9-mer epitopes predicted for HLA-Al binding using 
both the SYFPErrai and NIH algorithms and the epitope density of regions of overlapping 
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epitopes, and of epitopes in the whole protein, and the ratio of these two densities. (The ratio must 
exceed one for there to be a cluster by the above definition; requiring higher values of this ratio 
reflect preferred embodiments). Individual 9-niers arc ranked by score and identified by the 
position of their first amino in the complete protein sequence. Each potential cluster &om a protein 
is numbered. The range of amino acid positions within the complete sequence that the cluster 
covers is indicated as are the rankings of the individual predicted epitopes it is made up of. 



Table 37 

BIMAS-NIH/Parker algorithmResultsfor gplOO 



Rank 






Rank 


Start 


Score 


1 






AX 


416 


19 


2 


OUz 




22 


25 


18 


3 


162 


226 


23 


566 


17 


4 


18 


118 


24 


603 


15 


5 


178 


118 


25 


384 


14 


6 


273 


117 


26 


13 


14 


7 


601 


81 


27 


290 


12 


8 


243 


63 


28 


637 


10 


9 


606 


60 


29 


639 


9 


10 


373 


50 


30 


485 


9 


11 


544 


36 


31 


453 


8 


12 


291 


29 


32 


102 


8 


13 


592 


29 


33 


399 


8 


14 


268 


29 


34 


456 


7 


15 


47 


27 


35 


113 


7 


16 


585 


26 


36 


622 


7 


17 


576 


21 


37 


69 


7 


18 


465 


21 


38 


604 


6 


19 


570 


20 


39 


350 


6 


20 


9 


19 


40 


583 


5 
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Table 38 

SYFPEITHI (Rammensee algorithm) Results for gplOO 



Rank 


Start 




Rank 


Start 


Score 


1 


606 


30 


J I 




20 


2 


162 


29 






20 


3 


456 


2o 


10 

Dy 




20 


4 


18 


oo 
lo 


Af\ 


610 


19 


5 


602 


2/ 


At 




19 


6 


598 


21 






19 


7 


601 


2o 




583 


19 


8 


597 


20 




570 


19 


9 


13 


2o 




488 


19 


10 


585 


25 


40 


446 


19 


11 


449 


25 


AT 


'^77 


19 


12 


4 


23 


/iQ 
4o 




19 


13 


603 


24 


AQ 




19 


14 


576 


24 


5U 




19 


15 


453 


24 


51 


iOV 


19 


16 


178 


24 


52 




19 


17 


171 


24 


53 


oo 


19 


18 


11 


'^A 

24 


54 


47 


19 


19 


619 


23 


55 


in 


19 


20 


280 


23 


56 


AIR 


18 


21 


268 


23 


57 


OvJ 


18 


22 


592 


22 


CO 

5o 


OV/*r 


18 


23 


544 


22 


5y 




18 


24 


465 


22 


00 


^71 


18 


25 


399 


22 


01 




18 


26 


373 


22 


62 


450 


18 


27 


273 


22 


63 


409 


18 


28 


243 


22 


64 


400 


18 


29 


566 


21 


65 


371 


18 


30 


563 


21 


66 


343 


18 


31 


485 


21 


67 


298 


18 


32 


384 


21 


68 


209 


18 


33 


350 


21 


69 


102 


18 


34 


9 


21 


70 


97 


18 


35 


463 


20 


71 


76 


18 


36 


397 


20 


72 


69 


18 



Rank 
73 



Start Score 



74 

75 

76 

77 

78 

79 

80 

81 

82 

83 

84 

85 

86 

87 

88 

89 

90 

91 

92 

93 

94 

95 

96 

97 

98 

99 

100 

101 

102 

103 

104 

105 

106 

107 

108 

109 



60 
17 
613 
599 
572 
557 
556 
512 
406 
324 
290 
101 
95 
635 
588 
584 
577 
559 
539 
494 
482 
468 
442 
413 
408 
402 
286 
234 
217 
211 
176 
107 
96 
80 
16 
14 
7 



18 

18 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 
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Table 39 

Prediction of clusters for gp100 

Total AAs: 661 
Total 9-mers: 653 
SYFPEITHi 16: 109 9-mers 
NIH 5: 40 9-mers 













Cluster « 


AAs 


Epitopes (by Rank) 


Cluster 


Whole Pr 


Ratio 


SYFPEITHi 

^ III ^1 ill! 


1 


2tD26 


39.12J09. 34. 55.11.9. 
108 107 74 4 


0.440 


0.165 


2.668 




2 


69-115 


72.71.106, 53.85,105. 
70.84. 69.104 


A 140 

QJZio 


U.IDO 


1.290 




3 


95-115 


85. 105. 70. 84. 59 






1444 




4 


162-188 


2. 52. 17.103.16. 51 


A 000 




1.348 




5 


205-225 


50. 68. 102, 101 


n ion 


0 165 


1.155 




6 


243-258 


28.49 


C\^Of^ 
U. 1^9 


0.165 


0.758 




7 


267-306 


48, 21.38.27.20. 99. 83.37.67 




0 165 


1.364 




8 


322-332 


47,82 




0.165 


1.103 




9 


343-358 


66.33 




0.165 


0.758 




10 


371-381 


65,26 


0.182 


0165 


1.103 




11 


397-421 


35.25.64.98. 81.97.63.96 


0.320 


0.165 


1.941 




12 


442476 


95. 46. 11-. 62. 15. 3.35.24.94 


0.257 


0.165 






13 


482-502 


93.31.45.93 


0.190 


0.165 


1.155 




14 


539-552 


91.23 


0.143 


0.165 


0.866 




15 


556-627 


79, 78. 90, 30, 29, 61, 44. 60. 77. 14, 
89.43. 88.10.87.42.22,41.59. 8, 
6.76.7.5. 13,58.57.1.40.75,19 


0.431 


0.165 


2.611 


NIH 


1 


9to33 


20. 26.4.22 


0.160 


0.061 


2.644 




2 


268-281 


14,6 


0.143 


0.061 


2.361 




3 


290-299 


27.12 


0.200 


0.061 


3.305 




4* 


102-121 


32.35 


0.100 


0.061 


1653 




5* 


373-392 


10.25 


0.100 


0.061 


1653 




6 


453473 


31.34. 18 


0.143 


0.061 


2.361 




7 


566^00 


23. 19. 17.40.16,13 


0.171 


0.061 


2.833 




8 


601-614 


7. 2.24.38.9 


0.357 


0.061 


5.902 




9 


619-630 


1.36 


0.17 


0.061 


2.754 




10 


637-647 


28.29 


0.18 


0.061 


3.005 



*Neart)y but not overlapping epitopes 
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Table 40 

BIMAS-NIH/Parker algorithm Results for PSMA 



Rank 


dtaix 


Score 


1 


oo3 


1 JoU 


2 


711 


1 n<< 
IODj 


3 


4 


4oD 


4 


27 




5 


26 


3 /j 


6 


ooo 




7 


707 




8 


469 




9 


731 


in 


10 


35 


0/ 


11 


33 


o4 


12 


554 


ZC\ 

59 


13 


427 


50 


14 


115 


*f / 


15 


20 


/in 


16 


217 


ZO 


17 


583 


O/l 

z4 


18 


415 


1 Q 


19 


193 


14 


20 


240 


1 0 






11 


22 


260 


10 


23 


130 


10 


24 


741 


9 


25 


3 


9 


26 


733 


8 


27 


726 


7 


28 


286 


6 


29 


174 


5 


30 


700 


5 
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Table 41 

SYFPEITEa: (Rammensee algorithm) Results for PSMA 



Rank 


Start 


Score 


Rank 


Start 


Score 


Rank 


Start 


Score 


1 


469 


27 


31 


26 


20 


61 


305 


17 


2 


27 


27 


32 


3 


20 


62 


304 


17 


3 


741 


26 


33 


583 


19 


63 


286 


17 


4 


711 


26 


34 


579 


19 


64 


282 


17 


5 


354 


25 


35 


554 


19 


65 


169 


17 


6 


4 


25 


36 


550 


19 


66 


142 


17 


7 


663 


24 


37 


547 


19 


67 


122 


17 


8 


130 


24 


38 


390 


19 


68 


738 


16 


9 


57 


24 


39 


219 


19 


69 


634 


16 


10 


707 


23 


40 


193 


19 


70 


631 


16 


11 


260 


23 


41 


700 


18 


71 • 


515 


16 


12 


20 


23 


42 


472 


18 


72 


456 


16 


13 


603 


22 


43 


364 


18 


73 


440 


16 


14 


218 


22 


44 


317 


18 


74 


385 


16 


15 


109 


22 


45 


253 


18 


75 


373 


16 


16 


731 


21 


46 


91 


18 


76 


365 


16 


17 


668 


21 


47 


61 


18 


77 


361 


16 


18 


660 


21 


48 


13 


18 


78 


289 


16 


19 


507 


21 


49 


733 


17 


79 


278 


16 


20 


454 


21 


50 


673 


17 


80 


258 


16 


21 


427 


21 


51 


671 


17 


81 


247 


16 


22 


358 


21 


52 


642 


17 


82 


217 


16 


23 


284 


21 


53 


571 


17 


83 


107 


16 


24 


115 


21 


54 


492 


17 


84 


100 


16 


25 


33 


21 


55 


442 


17 


85 


75 


16 


26 


606 


20 


56 


441 


17 


86 


37 


16 


27 


568 


20 


57 


397 


17 


87 


30 


16 


28 


473 


20 


58 


391 


17 


88 


21 


16 


29 


461 


20 


59 


357 


17 








30 


200 


20 


60 


344 


17 
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Table 42 

Prediction of clusters for prostate^peciflc membrane antigen (PSMA) 

Total AAs: 750 
Total 9-mers: 742 
SYFPEITHI 16: 88 9-mers 
NIH 5:30 9-mer5 











Epitopes/AA 






A MM 


tpiiOp6S (uy ranK.; 


Cluster 


Whole Pr 


Ratio 


oYrrcl 1 ni 


1 


3to12 


32, 6 


A onn 


n 117 


l.r KTJ 




2 


1345 


HO OO 04 O C7 0J\ fifi 

13, 12, OO, ol, A o/, /o, 00 




n 117 

U. 1 1 f 


2.066 




3 


57-69 


y, 4/ 


n i«i4 


ft 117 

U. I I f 


1.311 




4 


100-138 


4e 0>l C? fl 

84, 83, 15, Z4, 0/. o 


U. 10^ 


n 117 


1 ?11 




5 


193-208 




0.111 


0.117 


0.947 




6 




ft9 14 '^Q 


0.273 


0.117 


2.324 




7 




R1 8fl 11 


0.182 


0.117 


1.550 




8 


278-297 


79.64. 23. 63.78 


0.250 


0.117 


2.131 




9 


354-381 


5. 59. 22,77. 43.76.76 


0.^0 


0.117 


2,131 




10 


385-405 


74, 38. 58,57 


0.190 


0.117 


1.623 




11 


440-450 


73, 56. 55 


0,273 


0.117 


2.324 




12 


454481 


20.72. 29,1,42,28 


0.214 


0.117 


1.826 




13 


507-523 


17.71 


0.118 


0.117 


1.003 




14 


547-562 


37. 36.35 


0.188 


0.117 


1.596 




15 


568-591 


27, 53. 34. 33 


0.167 


0.117 


1.420 




16 


603-614 


13.26 


0.167 


0.117 


1.420 




17 


631-650 


70. 69. 52 


0.150 


0,117 


1.278 




18 


660-681 


18.7,17. 51.50 


0.227 


0.117 


1.937 




19 


700719 


41,10.4 


0.150 


0.117 


1,278 




20 


731-749 


16. 49. 68,3 


0.211 


0.117 


1.794 


NIH 


1 


3tDl2 


25.3 


0.200 


0.040 


5.000 




2 


2043 


15, 5.4.11,10 


0.208 


0.040 


5.208 




3* 


415435 


18,13 


0.095 


0.040 


2.381 




4 


663-676 


1.6 


0.143 


0.040 


3.571 




5 


700-715 


30. 7. 3 


0.188 


0.040 


4.688 




6 


726-749 


27. 9. 26.24 


0.167 


0.040 


4.167 



*Nearby but not overlapping epitopes 
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Table 43 

BBMAS-NIH/Parker algorithm Results for PSA 



Rank 


Start 


Score 


1 


7 


607 


2 


170 


243 


3 


52 


124 


4 


53 


112 


5 


195 


101 


6 


165 


23 


7 


72 


18 


8 


245 


18 


9 


2 


16 


10 


59 


16 


11 


122 


15 


12 


125 


15 


13 


191 


13 


14 


9 


8 


15 


14 


6 


16 


175 


5 


17 


130 


5 
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Table 44 



Rank 


Start 


Score 


1 


72 


26 


2 


170 


22 


3 


53 


22 


4 


7 


22 


5 


234 


21 


6 


166 


21 


7 


140 


21 


8 


66 


21 


9 


241 


20 


10 


175 


20 


11 


12 


20 


12 


41 


19 


13 


20 


19 


14 


14 


19 


15 


130 


18 


16 


124 


18 


17 


121 


18 


18 


47 


18 


19 


17 


18 


20 


218 


17 


21 


133 


17 


22 


125 


17 


23 


122 


17 


24 


118 


17 


25 


110 


17 


26 


67 


17 


27 


52 


17 


28 


21 


17 


29 


16 


17 


30 


2 


17 


31 


184 


16 


32 


179 


16 


33 


158 


16 


34 


79 


16 


35 


73 


16 


36 


4 


16 
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Table 45 

Prediction of clusters for prostate specific antigen (PSA) 
Total AAs: 261 
Total 9-mers: 253 
SYFPEITH I 1 6: 36 9-mers 
NIH 5: 17 9-mers 

Epitopes/AA 



Cluster # AAs Epitopes (by rank) Cluster Whole Pr Ratio 



SYFPEITHI 


1 


2to29 


30.36. 4.11,14.29,19,13,28 


0.321 


0.138 


2.330 


2 


41-61 


12,18,27.3 


0.190 


0.138 


1.381 




3 


66-87 


8. 26, 1, 35. 34 


0527 


0.138 


1.648 




4 


110-148 


25.24.17.23. 16.22. 15.21.7 


0.184 


0.138 


1.332 




5 


158-192 


33.6.2.10.32.31 


0.171 


0.138 


1.243 




6 


234-249 


5.9 


0.125 


0.138 


0.906 




r 


118-133 


24.17.23. 16.22 


0.313 


0.138 


2.266 




8* 


118-138 


24,17,23.16,22.15 


0.286 


0.138 


2.071 


NiH 


1 


2-22 


9,1,14.15 


0.190 


0.065 


2.924 




2 


52-67 


3.4. 10 


0.188 


0.065 


2.879 




3 


122-138 


11.12. 17 


0.176 


0.065 


2.709 




4 


165-183 


6, 2. 16 


0.158 


0.065 


Z424 




5 


191-203 


13.5 


0.154 


0.065 


2362 




6** 


52-80 


3.4.10.7 


0.138 


0.065 


2.118 



These clusters are internal to the less preferred cluster #4. 



'Includes a nearby but not overlapping epitope. 
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BIMAS-NIH/Parker algorithm Results for PSCA 



Rank 


Start 


Score 


1 

X 


43 


m 


id 


5 


84 




7 


79 


4 


109 


36 


S 


105 


lS5 


6 


108 


24 


7 


14 


21 


8 


20 


18 


9 


115 


4? 


10 


42 




11 


36 


15 


12 


99 


9 


13 


58 


8 



SYFPEITHI (Rammensee algorithm) Results for PSCA 



Rank 


Start 


Score i 


Rank 


Start 


Score 


1 


108 


30 


17 


54 


19 


2 


14 


30 


18 


12 


19 


3 


105 


29 


19 


4 


19 


4 


5 


28 


20 


1 


19 


5 


115 


26 


21 


112 


18 


6 


99 


26 


22 


101 


18 


7 


7 


26 


23 


98 


18 


8 


109 


24 


24 


51 


18 


9 


53 


23 


25 


43 


18 


10 


107 


21 


26 


106 


17 


11 


20 


21 


27 


104 


17 


12 


8 


21 


28 


83 


17 


13 


13 


20 


29 


63 


17 


14 


102 


19 


30 


50 


17 


15 


60 


19 


31 


3 


17 


16 


57 


19 


32 


9 


16 






33 


92 


16 
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Table 48 

Prediction of clusters for prostate stem cell antigen (PSCA) 

Total Ms: 123 
Total 9-mers: 115 
SYFPEITHI 16:33; 
SYFPEITHl 20:13 
NIH 5:13 

















SYFPEITHI >16 


Cluster # 
1 
2 
3 


AAs 
1to26 
43-71 
92-123 


Epitopes (by rank) 

20,31. 19,4,7,12, 33.18.13,2.11 

25. 30.24,9,17,16.15.29 

32,23.6.27. 14.22,3.26,10, 

1.8.21.5 


Cluster 
0.393 
0.276 
0.406 


Whole Pr. 

0.268 
0.268 
0.268 


Ratio 
1.464 
1028 
1.514 


SYFPEITHI >20 


1 

2 


5to28 
99-123 


4,7,12,13.2.11 
6.3. 10.1.8.5 


0.250 
0.240 


0.106 
0.106 


2.365 
2.271 


NIH 


1 

2 
3 
4* 


5to28 
36-51 
99-123 
105-116 


2.3.7.8 

11.10.1 

12.5.6,4.9 

5.6.4 


0.167 
0.188 
0.200 
0.250 


0.106 
0.106 
0.106 
0.106 


1.577 
1.774 
1.892 
2.355 



This cluster is Internal to the less preferred cluster #3. 
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In tables 49-60 epitope prediction and cluster analysis data for each algorithm are 
presented together in a single table. 
Table 49 

Prediction of clusters for MAGE-1 (NIH algorithm) 

Total AAs: 309 
Total 9-nners: 301 
NIH 5:19 9-mers 



Cluster # 


AAs 


Epitope 


Start 


H\H 




Epitopes/AA 








Rank 


Position 


Score 


Cluster 


Whole Pr. 


Ratio 


1 


18-32 


16 


18 


9 


0.133 


0.063 


2.112 






19 


24 


7 








2 


101-113 


14 


101 


11 


0.154 


0.063 


2.442 






7 


105 


44 








3 


146-159 


9 


146 


32 


0.143 


0.063 


2.263 






3 


151 


169 








4 


169-202 


10 


169 


32 


0.176 


0.063 


2.796 






13 


174 


16 












18 


181 


8 












17 


187 


8 












6 


188 


74 












5 


194 


110 








5 


264-277 


2 


264 


190 


0.143 


0.063 


2.263 






12 


269 


20 








6 


278-290 


1 


278 


743 


0.154 


0.063 


2.437 






11 


282 


28 
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Table 50 

Prediction of clusters for MAGE-1 (SYFPEITHI algorithm) 



Total AAs:309 
Total 9-mers: 301 
SYFPEITHI 16: 46 9-mers 



Cluster # Aas 


Epitope 


Start 


SYFPEITHI 


Epitopes/AA 


Rank 


Position 


Score 


Cluster Whole Ratio 


1 7-49 


22 


7 


19 


0.233 0.153 1.522 




9 


15 


22 






27 


18 


18 






16 


20 


20 






28 


22 


18 






29 


24 


13 






33 


31 


17 






30 


35 


18 






2 


38 


26 






17 


41 


20 




2 89-132 


10 


89 


22 


0.273 0.153 1.783 




18 


92 


20 






7 


93 


23 






23 


96 


19 






43 


98 


16 






4 


101 


25 






8 


105 


23 






34 


107 


17 






35 


108 


17 






36 


113 


17 






37 


118 


17 






19 


124 


20 




3 167-203 


44 


167 


16 


0.270 0.153 1.766 




20 


169 


20 






12 


174 


21 






24 


181 


19 






6 


187 


24 






31 


188 


18 






25 


191 


19 






38 


192 


17 






1 


194 


27 






13 


195 


21 




4 230-246 


14 


230 


21 


0.118 0.153 0.769 




39 


238 


17 




5 264-297 


15 


264 


21 


0.235 0.153 1.538 




32 


269 


18 






40 


270 


17 






26 


271 


19 






46 


275 


16 






3 


278 


26 






21 


282 


20 






41 


289 


17 
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Table 51 

Prediction of clusters for WlAGE-2 (NIH algorithm) 



Total AAs: 314 
Total 9-mers: 308 
NIH >= 5: 20 9-mers 



Cluster # AAs 


Epitope 


Start 


NIH 




Epitope/AA 






Rank 


Position 


Score 


Cluster 


Whole Pr. 


Ratio 


1 101-120 


18 


101 


5.373 


0.150 


0.065 


2.310 




16 


108 


6.756 










1 


112 


2800.697 








2 153-167 


8 


153 


31.883 


0.200 


0.065 


3.080 




4 


158 


168.552 










7 


159 


32.138 








3 169-211 


14 


169 


8.535 


0.209 


0.065 


3.223 




19 


174 


5.346 










6 


176 


49.993 










11 


181 


15.701 










15 


188 


7.536 










12 


195 


12.809 










5 


200 


88.783 










10 


201 


16.725 










17 


203 


5.609 








4 271-284 


3 


271 


398.324 


0.143 


0.065 


2.200 




9 


276 


19.658 
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Table 52 

Prediction of clusters for MAGE-2 (SYFPEITHI algorithm) 

Total Ms: 314 
Total 9-mers: 308 



SYFPEITHI 1 6: 52 9-mers 



Cluster U 


AAs 


Enliooe 


Start 


SYFPEITHI 




Eoitooes/AA 


Katio 


Rank 


Position 


Score 


Cluster 


Whole Pr. 


1 


15-32 


13 


15 


21 


0.278 


0.169 


1,645 


29 


18 


18 












43 


20 


16 












30 


22 


18 












91 


n 


19 








2 


37-56 


31 


37 


18 


0.250 


0.169 


1.481 






16 


40 


20 












44 


44 


16 












14 


45 


21 












22 


48 


19 








3 


96-133 


36 


96 


17 


0.211 


0.169 


1.247 






46 


101 


16 












6 


108 


25 












47 


109 


16 












2 


112 


27 












37 


120 


17 












38 


125 


17 












17 


131 


20 








4 


153-216 


12 


153 


22 


0.344 


0.169 


Z036 






39 


158 


17 












7 


159 


25 












23 


161 


19 












24 


162 


19 












48 


164 


16 












49 


167 


16 












32 


170 


18 












50 


171 


16 












4 


174 


26 












9 


176 


24 












51 


177 


16 












15 


181 


21 












25 


188 


19 












18 


194 


20 












33 


195 


18 












19 


198 


20 












3 


200 


27 












1 


201 


28 












40 


202 


17 












10 


203 


23 












52 


9m 


16 








5 


237-254 


26 


237 


19 


0.167 


0.169 


0.987 






27 


245 


19 












34 


?46 


18 








6 


271-299 


8 


271 


25 


0.241 


0.169 


1.430 






35 


276 


18 












41 


277 


17 












11 


278 


23 












28 


283 


19 












20 


285 


20 












49 


m 


17 
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Table 53 

Prediction of clusters for MAGE-3 {NIH algorithm) 



Total AAs:314 
Total 9-mers: 308 
NIH 5: 22 9-mers 



Clufitor # 


AAs 


Eoitope 


Start 


NIH 




Epitopes/AA 


Ratio 




Rank 


Position 


Score 


Cluster 


Whole Pr. 


I 


lull 




101 


11.002 


0.200 


0,071 


2.800 






21 


105 


6.488 












8 


108 


49.134 












2 


112 


339.313 








2 


153-167 


18 


153 


7.776 


0.200 


0.071 


2.800 






6 


158 


51.77 












22 


159 


5.599 








3 


174-209 


17 


174 


8.832 


0.194 


0.071 


2.722 






7 


176 


49.993 












13 


181 


15.701 












19 


188 


7.536 












14 


195 


12.809 












5 


200 


88.783 












12 


201 


16.725 








4 


237-251 


16 


237 


10.868 


0.200 


0.071 


2.800 






4 


238 


148.896 












20 


243 


6.88 








5 


271-284 


1 


271 


2655.495 


0.143 


0.071 


2.000 






11 


276 


19.658 
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Table 54 

Prediction of clusters for MAGE-3 (SYFPEITHI algorithm) 

Total AAs: 314 
Total 9-mers: 308 
SYFPEITHI 16: 47 9-mers 



Cluster « AAs 



Eoitone 
■ Rank 



15-32 



12 
26 
37 
27 



Start 
Position 



SYFPEITHI 
Score — 



Cluster 



15 
18 
20 
22 

_2i. 



21 
18 
16 
18 

J2. 



0.278 



Eoitooes/AA 
Whole Pr. 



Ratio 



0.153 



1.820 



38-56 



38 
15 
39 
13 
19 



38 
40 
44 
45 
48 



16 
20 
16 
21 



0.263 0.153 1.725 



101-142 



28 
40 
1 
6 
31 
32 
16 



101 
105 
108 

112 
120 
125 
131 



18 
16 
31 
25 
17 
17 
20 



0.190 0.153 1.248 



4 


153-216 


20 


153 


19 


0313 


0.153 


2.048 




29 


156 


18 












33 


158 


17 












21 


159 


19 












34 


161 


17 












42 


164 


16 












43 


167 


16 












10 


174 


22 












8 


176 


23 












14 


181 


21 












22 


188 


19 












44 


193 


16 












11 


194 


22 












23 


195 


19 












45 


197 


16 












17 


198 


20 












3 


200 


27 












2 


201 


28 












35 


202 


17 












46 


208 


18 








5 


220-230 


5 


220 


26 


0.182 


0.153 


1.191 






47 


222 


16 








6 


237-246 


7 


237 


25 


0.200 


0.153 


1.311 






9 


238 


23 








7 


271-293 


4 


271 


27 


0.217 


0.153 


1.425 






30 


276 


18 












24 


278 


19- 












36 


283 


17 












25 


285 


19 
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Table 55 

Prediction of dusters for FRAME (NIH algorithm) 



Total AAs: 509 
Total 9-mers: 501 
NIH 5: 40 9-mers 



Pliictor it 


AAs 


Eoitope 


Start 


NIH 




Epitopes/AA 


Ratio 




Ranic 


Position 


Score 


Cluster 


Whole Pr. 


1 


33-47 


20 


33 


18 


0.133 


0.080 


1.670 




17 


39 


21 








d. 


r l"0 1 


g 


71 


50 


0.2 


0.07984 


2.605 






32 


73 


7 








o 
o 




23 


100 


15 


0.2 


0.07984 


2.505 






24 


99 


13 








A 




38 


126 


5 


0,2 


0.07984 


2.505 






35 


127 


6 








c 
0 




5 


224 


124 


0.130 


0.080 


1.634 






ft 
O 


230 


63 














C\J\J 


5 








0 




1ft 
1 o 


290 


18 


0.214 


0.080 


2.684 










23 












7 


295 


66 








"7 




9ft 




10 


0.200 


0.080 


2.505 






30 


308 


8 












25 


312 


13 












36 


316 


6 








8 


394-409 


2 


394 


182 


0.188 


0.080 


2.348 






12 


397 


42 












31 


401 


7 








9 


422-443 


10 


422 


49 


0.227 


0.080 


2.847 






3 


425 


182 












34 


431 


7 












29 


432 


9 












4 


435 


160 








10 


459-487 


15 


459 


21 


0.172 


0.080 


2.159 






11 


462 


45 












22 


466 


15 












40 


472 


5 












37 


479 


6 
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Table 56 

Prediction of clusters for FRAME (SYFPEITHI algorithm) 



Total AAs: 509 
Total 9-mers: 501 
SYFPEITHI 17: 80 9-mers 



Cluster # 


AAs 


Epitope 


Start 


SYFPEITHI 




cpiropss/MM 






Rank 


Position 


Score 


Cluster 


wnoie rr. 


Kaiio 


1 


18-59 


65 


18 


17 


0.238 


0.160 


1.491 






50 


21 


18 












66 


26 


17 












35 


33 


20 












22 


34 


22 












51 


37 


18 












5 


39 


27 












23 


40 


22 












13 


44 


24 












46 


51 


19 








2 


78-115 


36 


78 


20 


0.263 


0.160 


H CAO 

1.d4o 






67 


80 


17 












52 


84 


18 












24 


86 


22 












53 


91 


18 












25 


93 


22 












9 


99 


25 












8 


100 


26 












54 


103 


18 












55 


107 


18 








3 


191-202 


56 


191 


18 




U. lOU 








38 


194 


20 








4 


205-215 


26 


205 


22 


n 4 00 
U. lOZ 


U.lOU 


1,10*3 






27 


207 


22 








5 


222-238 


47 


222 


19 


0.235 


0.160 


•4 ATA 
1.4/4 






14 


224 


24 












69 


227 


17 












57 


230 


18 








6 


241-273 


70 


241 


17 


0.212 


0.160 


1.328 






15 


248 


24 












71 


255 


17 












30 


258 


21 












39 


259 


20 












58 


261 


18 












40 


265 


20 








7 


290-342 


72 


290 


17 


0.208 


0,160 


1.300 






48 


293 


19 












31 


298 


21 












73 


301 


17 












18 


305 


23 












6 


308 


27 












10 


312 


25 












19 


316 


23 












28 


319 


22 
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Prediction of clusters for FRAME (SYFPEITHI algorithm) 



Total AAs: 509 
Total 9-mers: 501 
SYFPEITHI 17: 80 9-mers 



Cluster # 


AAs 


Epitope 


Start 


SYFPEITHI 


Epitopes/AA 


Ratio 




Rank 


Position 


Score 


Cluster Whole Pr. 






41 


326 


20 










74 


334 


17 






Q 

0 


o4o-ooo 


oy 




18 


0238 0.160 


1,491 






ou 


348 


18 












351 


17 












353 

www 


23 










7fi 

/D 


355 

www 


17 








004-44/ 


4Q 


3fi4 

WU*T 


19 


0 250 0.160 


1.566 








771 

Of 1 


21 










il 


779 
Or £. 


55 

faW 












77*^ 
Of 0 


18 










77 


OOZ 


17 










Oi 
£A 


700 

09U 


97 

£w 










(0 


701 

0«7 1 


17 










1 


704 


70 










4z 


707 












oz 


403 

*tWw 


18 










00 


410 


21 










47 
*to 


418 


20 










34 


419 


21 










7 


422 


27 










2 


425 


29 










79 


426 


17 










63 


428 


18 










64 


431 


18 










12 


432 


25 










16 


43S 


24 










80 


439 


17 






10 


455-474 


29 


455 


22 


0.200 0.160 


1.253 






17 


459 


24 










4 


462 


28 










3 


466 


29 
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Predication of clusters for CEA (NIH algorithm) 



Total AAs:702 
Total 9-mers: 694 
NIH 5: 30 9-mers 



Cluster # 


AA 


Peptides 
Rank 


Start 
Position 


Score 


Cluster 


Peptides/AAs 
Whole Pr. 


Ratio 


1 


17-32 


5 
7 

20 


17 
18 

24 


46.873 
12.668 


V. too 


0.043 


4.388 


2 


113-129 


2 
15 


113 
121 


167.991 
21.362 


0.118 


0.043 


2.753 


3 


172-187 


25 
14 


172 
179 


9.165 
27.995 


0.125 


0.043 


2.925 


4 


278-291 


30 
17 


278 
283 


5.818 
19.301 


0.143 


0.043 


3.343 


5 


350-365 


9 
12 


350 
357 


43.075 
27.995 


0.125 


0.043 


2.925 


6 


528-543 


8 
13 


528 
535 


43.075 
27.995 


0.125 


0.043 


2.925 


7 


631-645 


23 
19 
24 


631 
634 
637 


9.563 
13.381 
9.245 


0.200 


0.043 


4.680 


8 


691-702 


1 

27 


691 
694 


196.407 
7.769 


0.167 


0.043 


3.900 
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Predication of clusters for CEA (SYFPHTHI algorithm) 

Total AAs:702 
Total 9-mers: 694 
SYFPEITHI 16:81 9-mers 



Cluster # 


AA Peptides 




Score 




Peptfdes/AAs 


Ratio 




Rank 






Cluster 


Whole Pr. 


1 


5-36 


67 


O 


16 


0.250 


0.117 


2.140 




23 


19 


19 

i 9 












24 


ID 


1Q 












9 


17 
1 f 


22 












25 


lO 


19 












32 


19 


18 












68 




16 












33 


28 


18 








2 


37-62 


41 


^^7 


17 


0.269 


0.117 


2,305 






20 


44 


20 












26 


45 


19 












42 




17 












27 


50 


19 












43 


53 


17 












44 


54 


17 








<j 


99-115 


14 


99 


21 


0.235 


0.117 


2.014 






5 


100 


23 












45 


104 


17 












34 


107 


18 








A 


116-129 


69 


116 


16 


0.143 


0.117 


1.223 






21 


121 


20 








0 


172-187 


46 


172 


17 


0,125 


0.117 


1.070 






70 


179 


16 










192-202 


3 


192 


24 


0.182 


0.117 


1.557 






47 


194 


17 








( 


226-241 


48 


226 


17 


0.188 


0.117 


1.605 






49 


229 


17 












15 


233 


21 








8 


307-318 


11 


307 


22 


0.250 


0,117 


2.140 






71 


308 


16 












51 


310 


17 








9 


319-349 


52 


319 


17 


0.129 


0.117 


1.105 






53 


327 


17 












72 


335 


16 












35 


341 


18 








10 


370-388 


12 


370 


22 


0.211 


0.117 


1.802 






54 


372 


17 












74 


375 


16 












6 


380 


23 








11 


403-419 


56 


403 


17 


0.235 


0.117 


2.014 






57 


404 


17 












58 


407 


17 












28 


411 


19 
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12 


427-442 


59 


427 


17 


0.188 


0.117 


1.605 






75 


432 


16 












76 


434 


16 








13 


450-462 


77 


450 


16 


0.154 


0.117 


1,317 






13 


454 


22 








14 


488-505 


36 


488 


18 


0.167 


0.117 


1.427 






18 


492 


21 












60 


497 


17 








15 


548-558 


4 


548 


24 


0.182 


0.117 


1.557 




61 


550 


17 








16 


565-577 


62 


565 


17 


0.154 


0.117 


1.317 






19 


569 


21 








17 


579-597 


78 


579 


16 


0.143 


0.117 


1.223 






79 


582 


16 












7 


589 


23 








18 


605-618 


2 


605 


25 


0.143 


0.117 


1.223 






38 


610 


18 








19 


631-669 


29 


631 


19 


0.154 


0.117 


1.317 






63 


637 


17 












80 


644 


16 












64 


652 


17 












39 


660 


18 












81 


661 


16 








20 


675-702 


22 


675 


20 


0.286 


0.117 


O AACt 






30 


683 


19 












31 


687 


19 . 












40 


688 


18 












65 


690 


17 












1 


691 


31 












66 


692 


17 












8 


694 


23 
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Table 59 

Predication of clusters for SCP-1 (NIH algorithm) 



Total AAs: 976 
Total 9-mers: 968 
NIH 5: 37 9-mers 



Cluster # 


AA 


Peptides 
Rank 


Start 
Position 


Score 


uiusier 


Peptides/AAs 
Whole Pr 


Ratio 


1 


101-116 


15 
13 


101 

4 AO 


40-589 


0.125 


0.038 


3.270 


2* 


281-305 


14 
24 
17 


281 
288 
297 


44.944 
15,203 
32.857 


0.12 


0.038 


3.139 


3 


431-447 


8 

26 
4 


431 
438 
439 


80.217 
11.861 
148.896 


0.073 


0.038 


1.914 


4 


557-579 


11 
19 
6 
18 


557 
560 

564 
571 


64.335 
24,937 

87.586 
32.765 


0.174 


0.038 


4.550 


5 


635-650 


10 
34 


635 
642 


69.552 
6.542 


0.125 


0.038 


3.270 


6 


755-767 


36 

35 


755 

759 


5.599 

5.928 


0.154 


0.038 


4.025 


7 


838-854 


2 
28 


838 
846 


284.517 
11.426 


0.118 


0.038 


3.078 
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Table 60 

Predication of clusters for SCP-1 



Total AAs: 976 
Total 9Hners: 968 
Rammensee 16:118 9-mers 



Cluster # 


AA 


Peptides 


Start 


Score 




Peptides/AAs 








Rank 


Position 




Cluster 


Whole Pr. 


Ratio 


1 


8-28 


99 


8 


16 


0.143 


0.121 


1.182 






77 


15 


17 












100 


20 


16 








2 


63-80 


78 


63 


17 


0.222 


0.121 


1.838 






50 


66 


19 












102 


69 


16 












60 


72 


18 








3 


94-123 


79 


94 


17 


0.133 


0.121 


1.103 






12 


101 


23 












17 


108 


22 












103 


115 


16 








4 


126-158 


35 


126 


20 


0.182 


0.121 


1.504 






36 


133 


20 












51 


139 


19 












80 


140 


17 












61 


143 


18 












37 


150 


20 








5 


161-189 


38 


161 


20 


0.207 


0.121 


1.711 






52 


165 


19 












81 


171 


17 












82 


177 


17 












62 


178 


18 












39 


181 


20 








6 


213-230 


40 


213 


20 


0.167 


0.121 


1.379 






13 


220 


23 












28 


222 


21 








7 


235-250 


63 


235 


18 


0.125 


0.121 


1.034 






18 


242 


22 








8 


260-296 


83 


260 


17 


0.243 


0.121 


2.012 






105 


262 


16 












84 


267 


17 












106 


269 


16 












41 


270 


20 












64 


271 


18 












85 


274 


17 












19 


281 


22 












3 


288 


25 








9 


312-338 


108 


312 


16 


0.148 


0.121 


1.225 






29 


319 


21 












30 


323 


21 












65 


330 


18 








10 


339-355 


66 


339 


18 


0.235 


0.121 


1.946 






31 


340 


21 












42 


344 


20 












53 


347 


19 
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11 


376-447 


54 


Of O 


1Q 
19 


n 1QA. 


0 191 


1 608 






43 


382 


on 












44 


386 














20 


390 


oo 












55 


Oft"7 

397 


19 












6 


404 


OA 












86 


407 


1 f 












45 


>ii 1 
411 














67 


41 / 


1 A 
JO 












21 


4^0 


£^ 












46 


431 


on 












68 


432 


io 












32 


4oo 














7 


439 


Z4 








12 


455-488 


33 


455 


o<i 
21 


A OOR 


f\ 191 
U.1Z 1 








4/ 




or\ 

£\J 












56 


462 


19 












DT 
O/ 


400 


17 
1 f 












oo 


*rOD 


17 












14 


ATH 
*f/U 


oo 












109 


4/0 


1A 
ID 












34 


480 


OA 

C\ 








13 


51 5-530 


57 


olo 


1Q 


U. 1 zo 


n 191 


1.034 






22 


522 


22 








14 


557-590 


8 


557 


O/i 

ii4 


A 1A7 


n 191 


1 216 

1 1 w 






23 


564 


oo 












ft 
9 


0/1 


OA 

^4 














O/ D 


17 












CO 

OO 




1Q 








15 


610-625 


69 


610 


•1 Q 

18 


A 1 O*^ 
U.I ZO 


n 191 

U. IZ 1 


1 034 






91 


617 


1 f 








16 


633-668 


92 


633 


17 


/-V OOO 

0.222 










10 


635 


Oil 

24 












70 


638 


18 












9o 


o4U 


17 
I f 












4o 




OA 














OHO 


90 

^vl 












ill 
1 1 1 




16 












119 


\J\J\J 


16 

1 \j 








«! "7 
1 I 


D ( 4-DOO 


71 
f 1 


D/4 


1 ft 
1 o 


n 1fi7 

VJ. 1 Of 


0.121 


1.379 






1 1 


Of f 


9A 








18 


687-702 


1 


687 


o^ 


n oc 


A 1 01 


1 

1 .UO*f 






94 


694 


1 / 








19 


744-767 


113 


744 


16 


0.250 


A lOI 
U.1Z1 


9 Aftft 

z.uoo 






95 


745 


1 7 
1 f 












4 


745 


OK 












24 


752 


22 












2 


755 


26 












72 


759 


18 








20 


812-827 


97 


812 


17 


0.125 


0.121 


1.034 






115 


819 


16 








21 


838-857 


116 


838 


16 


0.150 


0.121 


1.241 






25 


846 


22 












74 


849 


18 
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22 


896-913 


117 


896 


16 


0.222 


0.121 


1.838 






98 


899 


17 












26 


902 


22 












76 


905 


18 









1 MLLAVLYCLL WSFQTSAGHF PRACVSSKNL MEKECCPPWS GDRSPCGQLS 
GRGSCQNILL 

5 61 SNAPLGPQFP FTGVDDRESW PSVFYNRTCQ CSGNFMGFNC GNCKFGFWGP 

NCTERRLLVR 

121 RNIFDLSAPE KDKFFAYLTL AKHTISSDYV IPIGTYGQMK NGSTPMFNDI 
NIYDLFVWMH 

181 YYVSMDALLG GSEIWRDIDF AHEAPAFLPW HRLFLLRWEQ EIQKLTGDEN 
10 FTIPYWDWRD 

241 AEKCDICTDE YMGGQHPTNP NLLSPASFFS SWQIVCSRLE EYNSHQSLCN 
GTPEGPLRRM 

301 PGNHDKSRTP RLPSSADVEP CLSIiTQYESG SMDKAANFSF RNTLEGFASP 
LTGIADASQS 

15 361 SMHKALHIYM NGTMSQVQGS ANDPIFLLHH AFVDSIFEQW LRRHRPLQEV 

YPEANAPIGH 

421 NRESYMVPFI PIiYRNGDFFI SSKDLGYDYS YLQDSDPDSP QDYIKSYLEQ 

ASRIWSWLLG 

481 AAMVGAVLTA LLAGLVSLLC RHKRKQLPEE KQPLLMEKED YHSLYQSHL 



20 



TYROSINASE PROTEIN 



25 



30 1 MNGDDAFARR PTVGAQXPEK IQKAFDDIAK YFSKEEWEKM KASEKIFYVY 

MKRKYEAMTK 

61 LGFKATLPPF MCNKRAEDFQ GNDLDNDPNR GNQVERPQMT FGRLQGISPK 
IMPKKPAEEG 

121 NDSEEVPEAS GPONDGKELC PPGKPTTSEK IHERSGPKR6 EHAWTHRLRE 
35 RKQLVIYEEI 

181 SDPEEDDE 

SSX-2 PROTEIN 

40 
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1 MWNLLHETDS AVATARRPKW LCAGALVLAG GFFLLGFLFG WFIKSSNEAT 

MXTPKHNMKA 

61 PLDELKAENI KKPLyUFTQI PHIAGTEQNF QIAKQIQSQW KBFGLDSVEL 

^^^'^^la?' NKTHPNYISI INEDGNEIFN TSLFEPPPPG YBNVSDIVPP PSAFSPQGMP 

EGDLVYVNYA ^^^^^^ DMKINCSGKI VIARYGKVPR GNKVKNAQIA GAKGVILYSD 

'^^^^^^ SYPDGWNLPG GGVQRGIIILN LNGAGDPIiTP GYPANEYAYR RGIAEAVGLP 
S I PVHP IGYY 

301 DAQKLLEKMG GSAPPDSSWR GSLKVPYNVG PGFTGNFSTQ KVKMHIHSTN 
EVTRIYNVIG 

361 TLRGAVEPDR YVILGGHRDS WVFGGIDPQS GAAWHEIVR SFGTLKKEGW 
R PRRT I IjFAS 

421 WDAEEFGLIiG STEWAEENSR LLQERGVAYI NADSSIEGNY TLRVDCTPLM 
YSLVHNLTKE 

4 81 LKSPDEGFEG KSLYESWTKK SPSPEFSGMP RISKLGSGND FEVFFQRLGI 
AS GRAR YTIQT 

541 WETNKFSGYP L.YHSVYETYE LVEKFYDPMF KYHLTVAQVR GGMVFEIANS 

I VLPFDCRDY 

601 AWLRKYADK lYSISMKHPQ EMKTYSVSFD SLFSAVKNFT EIASKFSERL 

^^^^^661^ LRMMNDQI^ LERAFIDPLG LPDRPFYRHV lYAPSSHNKY AGESFPGIYD 
ALFDIESKVD 

721 PSKAWGEVKR QIYVAAFTVQ AAAETLSEVA 

PSMA PROTEIN 



1: Hoiao sapiens tyrosinase PubMed. Pfotein, 

NM 00037 (oculocutaneous albinism lA) (TYR) , Taxonomy, OMIM, LinkOut 
2 MrnA 

LOCUS NM_000372 1964 bp mRNA PRI 31- 

OCT-2000 . 
DEFINITION Homo sapiens tyrosinase (oculocutaneous albinism lA) 

(TYR) , mRNA. 
ACCESSION NM_000372 
VERSION NM_000372.1 GI:4507752 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; , . . „ . 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 1964) 

AUTHORS Kwon BS, Haq AK, Pomerantz SH and Halaban R. 
TITLE Isolation and sequence of a cDNA clone for human 
tyrosinase that 

maps at the mouse c -albino locus 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (21), 7473-7477 
(1987) 

MEDLINE 88041128 
PUBMED 2823263 

REMARK Erratum: [[published erratum appears in Proc Natl Acad 
Sci U S A 
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1988 Sep;85{17) :6352]] 
REFERENCE 2 (bases 1 to 1964) 

AUTHORS Barton DE, Kwon BS and Francke U. , 
TITLE Hiaitian tyrosinase gene, mapped to chromosome 11 (ql4--- 

"^^'^^ ' defines second region of homology with mouse 

chromosome 7 

JOURNAL Genomics 3 (1), 17-24 (1988) 
89122007 
3146546 

3 (bases 181 to 1964) 

Shibahara,S. , Tomita.Y., Tagami,H. , Muller,R.M- and 



MEDLINE 
PUBMED 
REFERENCE 

AUTHORS 
Cohen, T. 

TITLE 
tyrosinase 

JOURNAL 

MEDLINE 
REFERENCE 

AUTHORS 
AN. 

TITLE 



Molecular basis for the heterogeneity of human 

Tohoku J. Exp. Med. 156 (4), 403-414 (1988) 
89222868 

4 (bases 1 to 1964) 

Bouchard B, Fuller BB, Vijayasaradhi S and Houghton 



Induction of pigmentation in mouse fibroblasts by 

expression of 

human tyrosinase cDNA 
JOURNAL J. Exp. Med. 169 (6), 2029-2042 (1989) 
MEDLINE 89279151 
PUBMED 2499655 
REFERENCE 5 (bases 1 to 1964) 

AUTHORS Takeda,A., Tomita^Y. , Okinaga^S., Tagami^H. and 

Shibahara , S . ^ . 

TITLE Functional analysis of the cDNA encoding human 
tyrosinase precursor 



JOURNAL 

MEDLINE 
REFERENCE 

AUTHORS 
Watanabe M. 

TITLE 
human 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



Biochem. Biophys. Res. Comnaun. 162 (3), 984-990 (1989) 
89351001 

6 (bases 1 to 1964) 

Kikuchi H, Miura H, Yamamoto H, Takeuchi T, Dex T and 
Characteristic sequences in the upstream region of the 

tyrosinase gene 

Biochim. Biophys. Acta 1009 (3), 283-286 (1989) 
90089403 
2480811 



7 (bases 1 to 1964) 
Giebel LB, Strunk KM and Spritz RA. 
Organization and nucleotide sequences of the human 

tyrosinase gene 

and a tanincated tyrosinase -related segment 
Genomics 9 (3), 435-445 (1991) 

91236163 



JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

Lethe B, 

TITLE 
autologous 

JOURNAL 



1903356 

8 (bases 1 to 1964) 

Brichard V, Van Pel A, Wolfel T, Wolfel C, De Plaen E, 
Coulie P and Boon T- 

The tyrosinase gene codes for an antigen recognized by 

cytolytic T lymphocytes on HLA-A2 melanomas 
J. Exp. Med, 178 (2), 489-495 (1993) 
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MEDLINE 93340625 
PUBMED 8340755 
COMMENT PROVISIONAL REF5EQ: This record has not yet been 

subject to final 



M27160.1 . 
FEATURES 

soxirce 



mRNA" 



gene 



CDS 



lA)" 



NCBI review. The reference sequence was derived from 

Location/Qualifiers 
1..1964 

/organism* "Homo sapiens" 
/db_xr e f = ^ t axon : 9 6 0 6 " 
/chromosomes 1 1 
/map*«llql4-q21'' 

/cell_line=*^MeWo melanoma DNA, and cDNA to 

/tissue_type= "placenta" 
1. .1964 
/gene=*TYR" 
/note=^^OCAIA" 
/db_xre f = Locus ID : 7299 
/db_xref» '*MIM : 203100 " 
83. .1672 
/gene= ^TYR" 

/EC_number='*l . 14 . 18 . 1" 
/note= ""Tyrosinase" 
/ codon_s t ar t = 1 
/db__xref ="LocusID:7299* 
/db_xref="*MIM : 203100" 

/product='*tyrosinase (oculocutaneous albinism 

/protein_id= "" NP 000363.1 " 
/db_xref="GI:4507753" 



/translation="MLLAVLyCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRS 

PCGQLSGRGSCQNILLSNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGN 

CKFGFWGPNCTERRLLVRRNIFDLSAPEKDKFFAYLTLAKHTISSDyVXPIGTYGQMK 

NGSTPMFNDINIYDLFVWMHYYVSMDALLGGSEIWRDIDFAHEAPAFLPWHRLFLLRW 

EQEIQKLTGDENFTIPYWDWRDAEKCDICTDEYMGGQHPTNPNLLSPASFFSSWQIVC 

SRLEEYNSHQSLCNGTPEGPLRRNPGNHDKSRTPRLPSSADVEFCLSLTQYESGSMDK 

AANFSFRNTLEGFASPLTGIADASQSSMHNALHIYMNGTMSQVQGSANDPIFLLHHAF 

VDSIFEQWLRRHRPLQEVYPEANAPIGHNRESYMVPFIPLYRNGDFFISSKDLGYDYS 

YLQDSDPDSFQDYIKSYLEQASRIWSWLLGAAMVGAVLTALLAGLVSLLCRHKRKQLP 

EEKQPLLMEKEDYHSLYQSHL" 
siq peptide 83. .136 

mat peptide 137.. 1669 

/note« " tyros inase" 

/product* "tyrosinase (oculocutaneous albinism 



lA)' 



misc feature 



560.. 1429 

/note= "tyrosinase; Region: Common central 



domain of 
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tyrosinase' 
BASE C0T3NT 520 a 462 c 

ORIGIN 

1 atcactgtag tagtagctgg 
5 ttcctgcaga 

61 ccttgtgagg actagaggaa 
tgtggagttt 

121 ccagacctcc gctggccatt 
tgatggagaa 
10 181 ggaatgctgt ccaccgtgga 

caggcagagg 

241 ttcctgtcag aatatccttc 
ccttcacagg 

301 ggtggatgac cgggagtcgt 
15 agtgctctgg 

361 caacttcatg ggattcaact 
caaactgcac 

421 agagagacga ctcttggtga 
agaaggacaa 
20 481 attttttgcc tacctcactt 

tcafcccccat 

541 agggacctat ggccaaatga 
tcaatattta 

601 tgacctcttt gtctggatgc 
25 ggggatctga 

661 aatctggaga gacattgatt 
ggcatagact 

721 cttcttgttg cggtgggaac 
acfctcactat 
30 781 tccatattgg gactggcggg 

agtacatggg 

841 aggtcagcac cccacaaatc 
cctcttggca 

901 gattgtctgt agccgattgg 
35 atggaacgcc 

961 cgagggacct ttacggcgta 
caaggctccc 

1021 ctcttcagct gatgtagaat 
gttccatgga 
40 1081 taaagctgcc aatttcagct 

cacttactgg 

1141 gatagcggat gcctctcaaa 
tgaatggaac 

1201 aatgtcccag gtacagggat 
45 atgcatttgt 

1261 tgacagtatt tttgagcagt 
tttatccaga 

1321 agccaatgca cccattggac 
taccactgta 
SO 1381 cagaaatggt gatttcttta 

gctatctaca 

1441 agattcagac ccagactctt 
aagcgagtcg 

1501 gatctggtca tggctccttg 
55 ccctgctggc 

1561 agggcttgtg agcttgctgt 
aaaagcagcc 



PCT/US2002/010189 



44^ CI 


537 t 




dctcty ay cLcLct u 


w L>y \«y ci^i«^^ 


CLCIL> L>cLy Lt^Ciy 


gaauycuccb 


ggccgcuuty 


I" a ^ ^*r/^ ^ f" n 
uy^Ciby ^ 


4" M ^ ^^rs 

uccccagagc 


^ 4~ ^^t" I" ^ ^ ^ 

cugcgt>cucw 


i- CUaa.y aacc 


gcgggga c ag 


gagcccwuy t 


ygccaycc ut- 


^ ^ ^ ^ ^ ^ 
uguccaacgc 


accac c ^yy y 






^4. f. A. 1. _ A. _ 


ay y uy 


i~ /Trt ^ a o <^ t~ ^ 

y cyyaaacu.g 


C3.ag U LL ^gg C 


L, c L uygggac 


yaagd,a.a.CoC 


ct. ucga I- ucy 


&gL.gcccca,g 


uagca.d.i3.gca 


H o a ^ a i^i^ 

CaCCaU Ccly C 


♦~ /^^/^ a ^ 9 +* 

u Ca.gaC C al.g 


a.aa.auggacc 


aaCa C C (J a. uy 


4* ^ 4* 9 9 ^^^9 9 

C U U aaCgaCa 


a u u a L. L.a ug t. 


guCaauygaC 


^^/■•9 ^^<^^4" 4" 

y Cacngcuug 


u t gc c ca ego. 


agCaC C ciy CJ c 




aagaaau c ca 


gaagc cgaca 


J 9 4* 9 9 

ggagacgaaa 


a t gc agaaaa 


^4*<^4~«^4 ^ ^ 

gcgugacai^L. 


cgcacagaug 


^^^^^ a 
cuaacL-uacu 


cagcccagca 


uCauCC UCCC 


aggagcacaa 


cage ca u. cag 


^ ^»4* 4" 4* 9 4" ^Hr^ 9 

ucuucaugca 


accccggaaa 


ccatgacaaa 


uccagaaccc 


c ucyccugay 


4" ^ ^ #^ ^ ^ 

cu ugacccaa 


L.ClL>y CLcLUv^ L.y 


ttagaaatac 


actggaagga 


+• +• 4* /^<^4* 9 /"t H ^ 

u utgcta.gt.c 


gcagcatgcd 


caatgcctfcg 




ctgccaacgd 


uccuaccccc 


^ 4* ^ 4~ 4~ ^9 


ggc u ccy aag 


ycaccy tccu 


^ ^ ^ 9 9 ^r9 9 

c^ucaayaay 


ataaccggga 


atcctacatg 


gttcctttta 


tttcatccaa 


agatctgggc 


tatgactata 


ttcaagacta 


cattaagtcc 


tatttggaac 


gggcggcgat 


ggfcaggggcc 


gtcctcactg 


gtcgtcacaa 


gagaaagcag 


cttcctgaag 



-150- 



wo 2003/008537 



PCTAJS2002/010189 



L621 actcctcatg gagaaagagg attaccacag cttgtatcag agccatttat 
aaaaggctta 

L681 ggcaatagag tagggccaaa aagcctgacc tcactctaac tcaaagtaat 
gtccaggttc 

5 1741 ccagagaata tctgctggta tttttctgta aagaccattt gcaaaattgt 

aacctaatac 

1801 aaagtgtagc cttcttccaa ctcaggtaga acacacctgt ctttgtcttg 
ctgttttcac 

1861 tcagcccttt taacattttc ccctaagccc atatgtctaa ggaaaggatg 
10 ctatttggta 

1921 atgaggaact gttatttgta tgtgaattaa agtgctctta tttt 



1: Homo sapiens synovial sarccaaa, PubMed. Protein, Related 

NM 0031 X breakpoint 2 (SSX2), mRNA Sequences, Taxonomy, OMIM, 

7"= LinkOut 

47 

15 LOCUS NM_003147 766 bp inRNA PRI 14- 

MAR-2001 

DEFINITION Homo sapiens synovial sarcoma, X breakpoint 2 (SSX2) , 
mRNA. 

ACCESSION NM_003147 
20 VERSION NM_003147.1 GI: 10337582 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

25 Euteleostomi 

Mammalia; Eutheria; Primates; Catarrbini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 766) 

AUTHORS Shipley M, Clark J, Crew AJ, Birdsall S, Rocques PJ, 
30 Gill S, 

Chelly J, Monaco AP, Abe S, Gusteraon BA and et al. 
TITLE The t (X;18) <pll.2;qll.2) translocation found in human 
synovial 

sarcomas involves two distinct loci on the X 

35 chromosome 

JOURNAL Oncogene 9 (5), 1447-1453 (1994) 
MEDLINE 94203675 
PUBMED 8152806 
REFERENCE 2 (bases 1 to 766) 
40 AUTHORS Crew, A. J., Clark, J., Fisher, C-, Gill,S., Grimer,R- , 

Chand,A. , 

Shipley, J., Gusterson,B. A. and Cooper, C.S. 
TITLE Fusion of SYT to two genes, SSXl and SSX2, encoding 
proteins with 

45 homology to the Kruppel -associated box in human 

synovial sarcoma 

JOURNAL EMBO J, 14 (10), 2333-2340 (1995) 
MEDLINE 95292974 
REFERENCE 3 (bases 1 tO 766) 
50 AUTHORS Tureci 0, Sahin U, Schobert I, Koslowski M, Scmitt H, 

Schild HJ, 

Stenner F, Seitz G, Rammensee KG and Pfreundschuh M, 
TITLE The SSX-2 gene, which is involved in the t{X;18) 
translocation of 
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gene 



synovial sarcomas, codes for the human tumor antigen 

HOM-MEL-40 

JOURNAL Cancer Res, 56 (20), 4766-4772 (1996) 
MEDLINE 96438636 
5 PUBMED 8840996 

COMMENT PROVISIONAL REFSEQ: This record has not yet been 
subject to final 

NCBI review. The reference secjuence was derived from 

X86175.1 . 

10 FEATURES Location/Qualif iers 

source 1 . .766 

/organisms *Homo sapiens* 
/db_xref="taxon:9606'' 
/chromosome= "X" 
15 /map= "Xpll . 23 -pll .22" 

/de v_s tage= adul t " 
1, .766 

/gene="SSX2'' 
/note=*"HD21; HOM-MEL-40; SSX" 
20 /db_xref = *^LocusID : 6757 * 

/db xref=^MIM: 300192 ^^ 
mi so feature 20,. 61 

/note=^^Kruppel aaociated box homology" 
CDS 92 . . 658 

25 /gene=^^SSX2" 

/codon_start=l 
/db_xref = '^LocusID: 6757 ^^ 
/db xref="'MIM; 3 00192 " 

/product ='*synovial sarcoma, X breakpoint 2" 
30 /protein_id= ^^ NP 003138.1 ** 

/db_xref =«GI : 10337583" 

/ trans lat ion= ^mGDDAFARRPTVGAQIPEKIQKAFDDIAKYFSKEEWEKMKASE 

35 KI FYVYMKRKYEAMTKLGFKATLPPFMOTKRAEDFQGNDLDNDPNRGNQVERPQMTF 

RLQGISPKIMPKKPAEEGNDSEEVPEASGPQNDGKELCPPGKPTTSEKXHERSGPKRG 

EHAWTHRLRERKQLVIYEEISDPESDDE" 
misc feature 161.. 337 
40 /note=~KRAB; Region: krueppel associated box* 

BASE COUNT 229 a 181 c 200 g 156 t 

ORIGIN 

1 ctctctttcg attcttccat actcagagta cgcacggtct gattttctct 
ttggattctt 

45 61 ccaaaatcag agtcagactg ctcccggtgc catgaacgga gacgacgcct 

ttgcaaggag 

121 acccacggtt ggtgctcaaa taccagagaa gatccaaaag gccttcgatg 
atattgccaa 

181 atacttctct aaggaagagt gggaaaagat gaaagcctcg gagaaaatct 
50 tctatgtgta 

241 tatgaagaga aagtatgagg ctatgactaa actaggtttc aaggccaccc 
tcccaccttt 

301 catgtgtaat aaacgggccg aagacttcca ggggaatgat ttggataatg 
accctaaccg 

55 361 tgggaatcag gttgaacgtc ctcagatgac tttcggcagg ctccagggaa 

tctccccgaa 

421 gatcatgccc aagaagccag cagaggaagg aaatgattcg gaggaagtgc 
cagaagcatc 
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4S1 tggcccacaa aatgatggga aagagctgtg ccccccggga aaaccaacta 
cctctgagaa 

541 gattcacgag agatctggac ccaaaagggg ggaacatgcc tggacccaca 
gactgcgtga 

5 601 gagaaaacag ctggtgattt atgaagagat cagcgaccct gaggaagatg 

acgagtaact 

€61 cccctcaggg atacgacaca tgcccatgat gacaagcaga acgtggtgac 
ctttcacgaa 

721 catgggcatg gctgcggacc cctcgtcatc aggtgcatag caagtg 

10 

1: Homo sapiens folate hydrolase PubMed/ Protein, Related 

MM 004 (prostate- specific meiribraiie antigen) Sequences, Taxonomy ^ OMIM. 
476 1 (FOLHl)/ mRNA LinkOut 
LOCUS NM_004476 2653 bp inRNA. PRI 01- 

NOV-2000 

15 DEFINITION Homo sapiens folate hydrolase (prostate-specific 
iTiembrane antigen) 

1 (FOLHl) , raRNA. 
ACCESSION NM_0 04476 
VERSION NM_004476.1 GI:4758397 

20 KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Cramiata; Vertebrata; 
Euteleostomi ; 

25 Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 2653) 

AUTHORS Israeli, R.S., Powell, Fair,W,R. and Heston,W.D. 
TITLE Molecular cloning of a complementary DNA encoding a 
30 prostate- specific membrane antigen 

JOURNAL Cancer Res. 53 (2), 227-230 (1993) 
MEDLINE 93113576 
REFERENCE 2 (bases 1 to 2653) 

AUTHORS Rinker-Schaeffer CW, Hawkins AL, Su SL, Israeli RS, 
35 Griffin CA, 

Isaacs JT and Heston WD. 
TITLE Localization and physical mapping of the prostate- 
specific membrane 

antigen (PSM) gene to human chromosome 11 
40 JOURNAL Genomics 30 (1), 105-108 (1995) 

MEDLINE 96129312 
PUBMED 8595888 
REFERENCE 3 (bases 1 to 2653) 

AUTHORS O'Keefe DS, Su SL, Bacich DJ, Horiguchi Y, Luo Y, 
45 Powell CT, 

zandvliet D, Russell PJ, Molloy PL, Nowak NJ, Shows 
TB, Mullins C, 

Vender Haar RA, Fair WR and Heston WD, 
TITLE Mapping, genomic organization and promoter analysis of 
50 the human 

prostate- specific membrane antigen gene 
JOURNAL Biochim. Biophys. Acta 1443 (1-2), 113-127 (1998) 
MEDLINE 99057588 
PUBMED 9838072 
55 REFERENCE 4 (bases 1 to 2653) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



AUTHORS 
Markham AF . 

TITLE 
specific 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



Maraj BH, Leek JP, Karayi All M, Lench NJ and 

Detailed genetic mapping around a putative prostate - 

meTTibrane antigen locus on human chromosome llpll.2 
Cytogenet. Cell Genet. 81 (1), 3-9 (1998) 
98358137 
9691167 

PROVISIONAL REFSEQ: This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived from 

M99487.1 . 

FEATURES Location/Qualifiers 
source 1..2653 

/organisms *Homo sapiens" 
/db_xref ="taxon: 9606" 
/chromosomes"!!" 
/map=^llpll.2'' 
/sexs^^male" 

/ce 1 1__1 ine = "LNCa P - ATCC" 
/cell_type= ^*pros tate" 

/tissue_type=*prostatic carcinoma metastatic 

/tissue_lib=*LNCaP cDNA of Ron Israeli" 
1. .2653 
/gene=^^FOLHl" 
/note=*FOLH; PSM; PSMA" 
/db xref=^LocusID: 2346 ^ . 
/db_xref=*MIM : 600934 * 
262, .2514 
/gene=^^FOLHl" 

/note=''folate hydrolase 1 (prostate-specific 

antigen) " 
/codon_start=l 
/db_xref =*LocusID : 2346 
/db_xref=^MIM : 600934 
/evidence=experimental 

/product=*folate hydrolase (prostate-specific 

membrane 

antigen) 1" 

/protein_id= " NP 004467.1 * 
/db_xref=^GI: 4758398" 

/translation^^MWMLLHETDSAVATARRPRWLCAGALVLAGGFFLLGFLFGWFIK 

SSNEATNITPKHNMKAFLDELKAENIKKFLYNFTQIPHLAGTEQNFQLAKQIQSQWKE 

FGLDSVELAHYDVLLSYPNKTHPNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPP 

FSAFSPQGMPEGDLVYVNYARTEDFFKLERDMKINCSGKIVIARYGKVFRGNKVKNAQ 

LAGAKGVlLYSDPADYFAPGVKSYPDGWNLPGGGVQRGNIXiNLNGAGDPLTPGYPANE 

YAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMGGSAPPDSSWRGSIiKVPYNVGPGFT 

GNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQSGA 



lymph node" 
gene 

CDS 
membrane 
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AVVHEIWSFGTLKKEGWRPiyiTILFASWDAEEPGLLGSTEWAEElSrSRIiLQERGVAYI 
NADSSIEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYESWTKKSPS 

5 

MPRISKLGSGlsroFEWFQRLGIASGRJ^YTKNWETNKFSGYPLYHSVYETyELV^ 

DPMFKYHLTVAQVRGGMVFELANSIVLPPDOT)YAVVLRKm)iaYS 

10 YSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLP 

DRPFYRHVIYAPSSHNKYAGESFPGIYDALFDIESKVDPSKAWGEVKRQIYVAAFTVQ 

AAAETLSEVA" 
misc feature 778.. 1029 
15 /note='*PA; Region: PA domain" 

BASE COUNT 782 a 524 c 640 g 707 t 

ORIGIN 

1 ctcaaaaggg gccggatttc cttctcctgg aggcagatgt tgcctctctc 

tctcgctcgg 

20 61 attggttcag tgcactctag aaacactgct gtggtggaga aactggaccc 

caggtctgga 

121 gcgaattcca gcctgcaggg ctgataagcg aggcattagt gagattgaga 

gagactttac 

181 cccgccgtgg tggttggagg gcgcgcagta gagcagcagc acaggcgcgg 
25 gtcccgggag 

241 gccggctctg ctcgcgccga gatgtggaat ctccttcacg aaaccgactc 
ggctgtggcc 

301 accgcgcgcc gcccgcgctg gctgtgcgct ggggcgctgg tgctggcggg 
tggcttcttt 

30 361 ctcctcggct tcctcttcgg gtggtttata aaatcctcca atgaagctac 

taacattact 

421 ccaaagcata atatgaaagc atttttggat gaattgaaag ctgagaacat 
caagaagttc 

481 ttatataatt ttacacagat accacattta gcaggaacag aacaaaactt 
35 tcagcttgca 

541 aagcaaattc aatcccagtg gaaagaattt ggcctggatt ctgttgagct 
agcacattat 

601 gatgtcctgt tgtcctaccc aaataagact catcccaact acatctcaat 
aattaatgaa 

40 661 gatggaaatg agattttcaa cacatcatta tttgaaccac ctcctccagg 

atatgaaaat 

721 gtttcggata ttgtaccacc tttcagtgct ttctctcctc aaggaatgcc 
agagggcgat 

781 ctagtgtatg ttaactatgc acgaactgaa gacttcttta aattggaacg 
45 ggacatgaaa 

841 atcaattgct ctgggaaaat tgtaattgcc agatatggga aagttttcag 
aggaaataag 

901 gttaaaaatg cccagctggc aggggccaaa ggagtcattc tctactccga 
ccctgctgac 

50 961 tactttgctc ctggggtgaa gtcctatcca gatggttgga atcttcctgg 

aggtggtgtc 

1021 cagcgtggaa atatcctaaa tctgaatggt gcaggagacc ctctcacacc 
aggttaccca 

1081 gcaaatgaat atgcttatag gcgtggaatt gcagaggctg ttggtcttcc 
55 aagtattcct 

1141 gttcatccaa ttggatacta tgatgcacag aagctcctag aaaaaatggg 
tggctcagca 
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1201 ccaccagata gcagctggag aggaagtctc aaagtgccct acaatgttgg 
acctggcttt ^^^^^^^^^^ tttctacaca aaaagtcaag atgcacatcc actctaccaa 

5 ^^^^^i32?agaatttaca atgtgatagg tactctcaga ggagcagtgg aaccagacag 

^^^^^Sei^ctgggaggtc accgggactc atgggtgttt ggtggtattg accctcagag 
tggagcagct^^^^^^^^^ aaattgtgag gagctttgga acactgaaaa aggaagggtg 

10 ^^^^^isSf^agaacaattt tgtttgcaag ctgggatgca gaagaatttg gtcttcttgg 

^^^^^^^isll^tgggcagagg agaattcaag actccttcaa gagcgtggcg tggcttatat 

15 ^^^^^isll^tcatctatag aaggaaacta cactctgaga gttgattgta caccgctgat 

gtacagcttg ^ . . , 

1681 gtacacaacc taacaaaaga gctgaaaagc cctgatgaag gctttgaagg 

caaatctctt 

1741 tatgaaagtt ggactaaaaa aagtccttcc ccagagttca gtggcatgcc 
20 ""^^^^itoraaattgggat ctggaaatga ttttgaggtg ttcttccaac gacttggaat 

^^^^'^^isll^gagcacggt atactaaaaa ttgggaaaca aacaaattca gcggctatcc 

25 ^^^^*^i921^agtgtctatg aaacatatga gttggtggaa aagttttatg atccaatgtt 

taaatatcac 4--s^«^«^*.t.^ 
1981 ctcactgtgg cccaggttcg aggagggatg gtgtttgagc tagccaattc 

''^^^^204l''ccttttgatt gtcgagatta tgctgtagtt ttaagaaagt atgctgacaa 

30 aatctacagt ^ . ♦.^*.«^*.t.f-«a 

2101 atttctatga aacatccaca ggaaatgaag acatacagtg tatcactcga 

ttcacttttt 

2161 tctgcagtaa agaatjtttac agaaattgct tccaagttca gtgagagact 

35 ^^^^^222rgacaaaagca acccaatagt attaagaatg atgaatgatc aactcatgtt 

*^^*^^^228l\catttattg atccattagg gttaccagac aggccttttt ataggcatgt 
catctatgct 

2341 ccaagcagcc acaacaagta tgcaggggag tcattcccag gaatttatga 

40 tgctctgttt 

2401 gatattgaaa gcaaagtgga cccttccaag gcctggggag aagtgaagag 

^^^^^2461^gttgcagcct tcacagtgca ggcagctgca gagactttga gtgaagtagc 
ctaaqaggat 

45 2521 tctttagaga atccgtattg aatttgtgtg gtatgtcact cagaaagaat 

^^^^^258?atattgataa attttaaaat tggtatattt gaaataaagt tgaatattat 
atataaaaaa 

2641 aaaaaaaaaa aaa 
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U20093. Human melanocyte-. . . [gi: 1142634] 

LOCUS HSPMEL17S2 2817 bp DNA PRI 20- 

NOV-1996 

DEFINITION Human melanocyte- specific (pmel 17) gene, exons 2-5/ 
5 and complete 

cds . 

ACCESSION U20 093 
VERSION U20093.1 01:1142634 

KEYWORDS 
10 SEGMENT 2 of 2 

SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

15 Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 2817) 

AUTHORS Kira.K.K., Youn^B.S., Heng,H.H., Shi,X.M., Tsui,L.C., 
Lee,Z,H., 

20 Pic]card,R.T. and Kwan,B.S. 

TITLE Genomic organization and FISH mapping of human Pmel 

17, the 

putative silver locus 
JOURNAL Pigment Cell Res. 9 (1), 42-48 (1996) 
25 MEDLINE 96314705 

REFERENCE 2 (bases 1 to 2817) 
AUTHORS Kwon,B.S. 
TITLE Direct Submission 

JOURNAL Submitted (05-JAN-1995) Indiana University School of 

30 Medicine, 

Microbiology and Immunology, 635 Barnhill Drive, 
Indianapolis, IN 

46202, USA 
FEATURES Location/Qualif iers 

35 source 1..2817 

/organisms «Homo sapi ens" 
/db_xref =*^taxon: 9606" 
/ chromosome= "12 " 
/map= ^ 12pter-q2 1" 

40 /clone_lib=« lambda FIX II from Stratagene" 

gene join (U19491 . 1 : 1111 . .3109,1.-2722) 

/gene="pmel 17" 

CDS 

join(U19491. 1:1111. .1186,67. .459 , 853 1014, 1266 2396 , 
45 2499.-2722) 

/gene="prael 17" 

/ codon_s tart=l 

/product =^Pmel 17" 

/protein_id=^^AAB19181 . 1" 
50 /db_xref =:«GI : 1142636" 

/ trans la t ion= ^^MDLVLKRCLLHLAVIGALLAVGATKVPRNQDWLGVSRQLRTKAWNRQLYPEW 
TEAQRLDCWRGGQVSLKVSNDGPTLIGANASFSIALKFPGSQKVLPDGQVIWVNNTIINGSQVWGG 
QPVYPQETDDACIFPDGGPCPSGSWSQKRSFVYWKTWGQYWQVLGGPVSGLSIGTGRAMLGTHTM 
55 EVTVYHRRGSRSYVPLAHSSSAFTITDQVPFSVSVSQLRALDGGNKHFLRNQPLTFALQLHDPSGY 
LAEADLSYTWDFGDSSGTLISRAPWTHTYLEPGPVTAQWLQAAIPLTSCGSSPVPGTTDGHRPT 
AEAPNTTAGQVPTTEWGTTPGQAPTAEPSGTTSVQVPTTEVISTAPVQMPTAESTGMTPEKVPVS 
EVMGTTLAEMSTPEATGMTPAEVSIWLSGTTAAQVTTTEWVETTARELPIPEPEGPDASSIMSTE 
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SITGSLGPLLDGTATLRLVKRQVPIJDCVLYRYGSFSVTIXJIVQGIESAEIL^ 
SCQGGLPKEACMEISSPGCQPPAQRLCQPVLPSPACQLVLHQILKGGSGTYCLNVSLADTNSIiAW 
STQLIMPGQEAGLGQVPLIVGILLVLMAVVLASLIYRRRLMKQDFSVPQLPHSS^ 
CPIGENSPLLSGQQV 
5 axon 67.. 459 

/gene=**pinel 17" 
/ntunberB2 
exon 853 . . 1014 

/gene=**pmel 17" 
10 /nutnber=3 
exon 1266. .2396 

/gene="pmel 17" 
/nuinber=4 
exon 2499. .>2722 

15 /gene=^*pmel 17" 

/number=5 

BASE COUNT 627 a 772 c 739 g 679 t 

ORIGIN 

1 gtgctaaaaa gatgccttct tcatttggct gtgataggtg ctttgtggct 
20 gtgggggcta 

61 caaaagtacc cagaaaccag gactggcttg gtgtctcaag gcaactcaga 
accaaagcct 

121 ggaacaggca gctgtatcca gagtggacag aagcccagag acttgactgc 
tggagaggtg 

25 181 gtcaagtgtc cctcaaggtc agtaatgatg ggcctacact gattggtgca 

aatgcctcct 

241 tctctattgc cttgaacttc cctggaagcc aaaaggtatt gccagatggg 
caggttatct 

301 gggtcaacaa taccatcatc aatgggagcc aggtgtgggg aggacagcca 
30 gtgtatcccc 

361 aggaaactga cgatgcctgc atcttccctg atggtggacc ttgcccatct 
ggctcttggt 

421 ctcagaagag aagctttgtt tatgtctgga agacctgggg tgagggactc 

ccttctcagc 

35 481 ctatcatcca cacttgtgtt tacttctttc tacctgatca cctttctttt 

ggccgcccct 

541 tccaccttaa cttctgtgat tttctctaat cttcattttc ctcttagatc 
ttttctcttt 

601 cttagcacct agcccccttc aagctctatc ataattcttt ctggcaactc 
40 ttggcctcaa 

661 ttgtagtcct accccatgga atgcctcatt aggacccctt ccctgtcccc 
ccatatcaca 

721 gccttccaaa caccctcaga agtaatcata cttcctgacc tcccatctcc 
agtgccgttt 

45 781 cgaagcctgt ccctcagtcc cctttgacca gtaatctctt cttccttgct 

tttcattcca 

841 aaaatgcttc aggccaatac tggcaagttc tagggggccc agtgtctggg 
ctgagcattg 

901 ggacaggcag ggcaatgctg ggcacacaca ccatggaagt gactgtctac 
50 catcgccggg 

961 gatcccggag ctatgtgcct cttgctcatt ccagctcagc cttcaccatt 
actggtaagg 

1021 gttcaggaag ggcaaggcca gttgtagggc aaagagaagg cagggaggct 

tggatggact 

55 1081 gcaaaggaga aaggtgaaat gctgtgcaaa cttaaagtag aagggccagg 

aagacctagg 

1141 cagagaaatg tgaggcttag tgccagtgaa gggccagcca gtcagcttgg 
agttggaggg 
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1201 tgtggctgtg aaaggagaag ctgtggctca ggcctggttc tcaccttttc 
tggctccaat 

1261 cccagaccag gtgcctttct ccgtgagcgt gtcccagttg cgggccttgg 
atggagggaa 

5 1321 caagcacttc ctgagaaatc agcctctgac ctttgccctc cagctccatg 

accccagtgg 

1381 ctatctggct gaagctgacc tctcctacac ctgggacttt ggagacagta 
gtggaaccct 

1441 gate tct egg gcacctgtgg teaetcatae ttacetggag cctggcecag 
10 tcactgccca 

1501 ggtggteetg caggctgcca ttcctcteac ctcctgtggc tcctccccag 

ttccaggcac 

1561 cacagatggg eacaggeeaa ctgeagaggc cectaacace aeagctggcc 
aagtgcctac 

15 1621 taeagaagtt gtgggtacta cacetggtea ggcgccaact gcagagccct 

etggaaccac 

1681 atctgtgeag gtgccaacca ctgaagtcat aagcactgca cetgtgeaga 
tgccaactgc 

1741 agagagcaca ggtatgacac etgagaaggt gecagtttca gaggtcatgg 
20 gtaccacaet 

1801 ggcagagatg tcaactccag aggctacagg tatgacacct gcagaggtat 
caattgtggt 

1861 gctttctgga accacagctg cacaggtaac aactacagag tgggtggaga 
ccacagctag 

25 1921 agagctacct atccctgagc ctgaaggtcc agatgccagc tcaatcatgt 

ctacggaaag 

1981 tattacaggt tccctgggcc ccctgctgga tggtacagcc accttaaggc 
tggtgaagag 

2041 acaagtcccc ctggattgtg ttctgtatcg atatggttec ttttecgtea 
30 ccctggacat 

2101 tgtccagggt attgaaagtg ccgagatect gcaggctgtg cegtecggtg 

agggggatgc 

2161 atttgagetg actgtgtcct gccaaggcgg getgcccaag gaagcctgca 

tggagatctc 

35 2221 ategeeaggg tgccagcccc ctgcccagcg gctgtgccag cctgtgctac 

ccagcccagc 

2281 ctgccagctg gttctgcacc agatactgaa gggtggctcg gggaeatact 
gcctcaatgt 

2341 gtctctgget gataccaaca gcctggcagt ggtcagcacc cagcttatca 
40 tgcctggtag 

2401 gtecttggac agagactaag tgaggaggga agtggataga ggggacagct 
ggcaagcagc 

2461 agacatgagt gaagcagtgc ctgggattct tctcacaggt caagaagcag 
gccttgggca 

45 2521 ggttccgctg atcgtgggca tcttgctggt gttgatggct gtggtccttg 

catctctgat 

2581 atataggcgc agacttatga agcaagactt ctccgtaccc cagttgccac 
atagcagcag 

2641 tcactggctg cgtctacccc gcatcttctg ctcttgtccc attggtgaga 
50 atagccccct 

2701 ectcagtggg cagcaggtct gagtactetc atatgatgct gtgattttcc 
tggagttgac 

2761 agaaaeaeet atatttcccc eagtcttecc tgggagacta ctattaactg 
aaataaa 
55 // 



NM_001648. Homo sapiens kail . . . [gi :4502172] 
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LOCUS NM_0 01648 1466 bp mRNA PRI 

31-OCT-2000 

DEFINITION Homo sapiens kallikrein 3, (prostate specific antigen) 
(KLK3) , 

5 mRNA. 
ACCESSION NM_001648 

VERSION NM_001648.1 GI:4502172 

KEYWORDS 

SOURCE human . 

10 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; 

Homini dae ; Homo . 
15 REFERENCE 1 (bases 1 to 1466) 

AUTHORS Lundwall,A. and Lilja,H. 

TITLE Molecular cloning of human prostate specific antigen 

cDNA 

JOURNAL FEES Lett. 214 (2), 317-322 (1987) 
20 MEDLINE 87190978 

REFERENCE 2 (bases 1 to 1466) 

AUTHORS Sutherland GR, Baker E, Hyland VJ, Callen DF, Close OA, 

Tregear GW, 

Evans BA and Richards RI . 
25 TITLE Hiiraan prostate -specific antigen (APS) is a member of 

the glandular 

kallikrein gene family at 19ql3 
JOURNAL Cytogenet, Cell Genet- 48 (4), 205-207 (1988) 
MEDLINE 89250658 
30 PUBMED 2470553 

REFERENCE 3 (bases 1 to 1466) 

AUTHORS Riegman PH, Klaassen P, van der Korput JA, Romijn JC 
and Trapman J. 

TITLE Molecular cloning and characterization of novel 

35 prostate antigen 

cDNA's 

JOURNAL Biochem. Biophys. Res. Commun. 155 (1), 181-188 (1988) 
MEDLINE 88326297 
PUBMED 2458104 
40 REFERENCE 4 (bases 1 to 1466) 

AUTHORS Schulz P, Stucka R, Feldmann H, Cotnbriato G, Klobeck HG 
and Fittler F. 

TITLE Sequence of a cDNA clone encompassing the complete 

mature human 

45 prostate specific antigen (PSA) and an \anspliced 

leader sequence 

JOURNAL Nucleic Acids Res. 16 (13), 6226 (1988) 
MEDLINE 88289366 
PUBMED 2456523 
50 REFERENCE 5 (bases 1 to 1466) 

AUTHORS Riegman PH, Vlietstra RJ, van der Korput JA, Romijn JC 
and Trapman J. 

TITLE Characterization of the prostate-specific antigen gene: 

a novel 

55 human kallikrein- like gene 

JOURNAL Biochem. Biophys. Res. Commun. 159 (1), 95-102 (1989) 
MEDLINE 89165891 
PUBMED 2466464 
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REFERENCE 6 (bsLses 1 to 1466) 
AUTHORS Henttu P and Vihko P. 

TITLE cDNA coding for the entire human prostate specific 

antigen shows 

5 high homologies to the human tissue kallikrein 

genes /-inon* 
JOURNAL Biochem. Biophys . Res. Commirn. 160 (2), 903-910 (1989) 

MEDLINE 89246551 
PUBMED 2470373 
10 REFERENCE 7 (bases 1 to 1466) 

AUTHORS Klobeck HG, Corribriato G, Schulz P, Arbusow V and 
Fittler F. . 
TITLE Genomic sequence of human prostate specific antigen 

(PSA) 

15 JOURNAL Nucleic Acids Res. 17 (10), 3981 (1989) 

MEDLINE 89282407 
PUBMED 2471958 
REFERENCE 8 (bases 1 to 1466) 

AUTHORS Lundwall A. , . 

20 TITLE Characterization of the gene for prostate-specific 

antigen, a human 

glandular kallikrein 
JOURNAL Biochem. Biophys. Res. Commian. 161 (3), 1151-1159 
(1989) 

25 MEDLINE 89302090 

PUBMED 2472789 

COMMENT PROVISIONAL REFSEQ: This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived 

30 from X05332.1. 

FEATURE S Loca t ion/Qual i f i er s 
source 1..1466 

/organisra=*'Homo sapiens" 

/ db_xr e f = t axon : 9 6 0 6 " 
35 /chromosome«**19" 

/map=«19ql3'' 

/ clone- ^lambda HPSA-l" 

/tissue^types^'prostate" 

/clone_lib=^ (lambda) gtll'' 
40 gene i . - 1466 

/gene^^KLKS" 

/note="PSA; APS" 

/db_xref=«LocusID:354'' 

/db_xref=«MIM: 176820" 
45 sig_peptide 44.. 94 

CDS 44.. 829 

/gene^^KLra" 

/EC_niunber=«3 . 4 . 21 . 77" 

/note='*antigen, prostate specific" 
50 /codon_start=l 

/db_xref= "Locus ID: 354" 

/db_xref = «MIM : 1 7 6 8 2 0 " 

/product=^*kallikrein 3, (prostate specific 

antigen) " 

55 /protein_id=^^NP_001639.1" 

/db xref=«GI: 4502173" 
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/traiislation='*MWVPWFLTLSVTWIGAAPLIIiSRIVGGWECEKHSQPWQVLVAS 

RGRAVCGGVLVHPQWVLTAAHCIRNKSVILLGRHSLFHPEDTGQVFQVSHSFPHPLYDMSLLKNRF 

LRPGDDSSHDIjMLLRLSEPAELTDAVKVI^LPTQEPALGTTCYASGWGSIEPEEFLTPKKLQCTO 

HVTSNDVCAQVHPQKVTKS'MLa\GRWTGGKSTCSGDSGGPLVClfGVIiQGIT^ 

YTKWHYRKtJIKDTIVANP" 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



variation 

variation 

misc__f eature 
variation 

raisc_feature 

protease" 

misc_feature 

raat_peptide 

antigen) " 

variation 

variation 



variation 



variation 



variation 



variation 



variation 



variation 



91 



91 

/allele^^C" 
/allele="T'' 

/db_xref=«dbSNP: 1135765" 

/allele=*C'' 
/allele=""T'" 

/db_xref=«dbSNP: 11573" 
95. .115 

/note=*propeptide (AA 1-7)^ 

97 

/allele="G'' 
/allele=*A'' 

/db_xref=«dbSNP: 113 5766" 
113. .802 

/note=*Tryp_SPc; Region: Trypsin-like serine 
116. .802 

/note='* trypsin; Region: Trypsin" 
116.. 826 

/products "kallikrein 3, (prostate specific 

280 
/allele=^C" 
/allele="T" 

/db_xref=«dbSNP: 1058072" 
280 

/allele=*C'' 
/allele='T'' 

/db_xref=~dbSNP: 12946" 

526 
/alleles ^A" 
/allele='^G" 

/db_xref="dbSNP: 1803130" 
643 

/allele=^A" 
/allele^^G" 

/db_xref=*dbSNP: 1803129" 
788 

/note=**WARNING: map location airibiguous" 

/alleles ^*T" 
/allele="C" 

/db_xref«^dbSNP: 1802720" 
844 
/allele="T" 
/allele="C" 

/db_xref=«dbSNP: 1058205" 
894 

/allele=^W 
/allele="G" 

/db_xref=*dbSNP: 1058274" 
906 
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55 



variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

misc_f eature 
polyA_site 



/allele=«T" 
/allele=«A'' 

/db_xref='^dbSNP:1803132'' 
918 

/allele=^*T*' 
/aHele= 

/dbjjcref =^^dbSNP : 1802722' 
953 

/allele=^^A" 
/alleles ^^G" 

/db__xref="dbSNP: 1802721*' 
1017 

/allele=*^T" 
/allele=^^C'' 

/db_xref = «dbSNP : 1803138'' 
1074 

/allele^^^C 
/allele=«T'' 

/db_xre f = '^dbSNP : 12 10 1 " 
1107 

/allele="A" 
/allele="G" 

/db^xref = "dbSNP : 1803131" 
1107 

/allele=^^G" 
/allele=^W 
/db_xref = ^^dbSNP : 6 9 98" 
1211 

/allele=«C" 
/allele=^T'' 

/db^xref =^^dbSNP : 1803136" 

1229 
/allele="A'' 
/allele=^^G" 

/db_xref=«dbSNP: 1803135" 

1249 

/allele="T" 
/allele=^^C" 

/db_xref =«dbSNP : 1803134" 
1273 

/allele='*A" 
/allele="G" 

/db_xref=^dbSNP: 12040" 
1326 

/allele="T" 
/alleles "C" 

/db__xref = ^^dbSNP : 1803 133 " 
1344 

/allele='*A" 
/allele=«G" 

/db_xref=^*dbSNP: 12863" 
1363 

/allele=«A" 
/allele=«G" 

/db_xref=«dbSNP: 1803 137" 
1445. .1450 

/note=**put.polyA signal" 
1466 
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/note=*'polyA site" 
BASE COUNT 338 a 382 c 422 g 324 t 

ORIGIN 

1 agccccaagc ttaccacctg cacccggaga gctgtgtgtc accatgtggg 

5 tcccggttgt 

61 cttcctcacc ctgtccgtga cgtggattgg tgctgcaccc ctcatcctgt 

ctcggattgt 

121 gggaggctgg gagtgcgaga agcattccca accctggcag gtgcttgtgg 
cctctcgtgg 

10 181 cagggcagtc tgcggcggtg ttctggtgca cccccagtgg gtcctcacag 

ctgcccactg 

241 catcaggaac aaaagcgtga tcttgctggg tcggcacagc ctgtttcatc 
ctgaagacac 

301 aggccaggta tttcaggtca gccacagctt cccacacccg ctctacgata 
15 tgagcctcct 

361 gaagaatcga ttcctcaggc caggtgatga ctccagccac gacctcatgc 

tgctccgcct 

421 gtcagagcct gccgagctca cggatgctgt gaaggtcatg gacctgccca 
cccaggagcc 

20 481 agcactgggg accacctgct acgcctcagg ctggggcagc attgaaccag 

aggagttctt 

541 gaccccaaag aaacttcagt gtgtggacct ccatgttatt tccaatgacg 
tgtgtgcgca 

601 agttcaccct cagaaggtga ccaagttcat gctgtgtgct ggacgctgga 
25 cagggggcaa 

661 aagcacctgc tcgggtgatt ctgggggccc acttgtctgt aatggtgtgc 

ttcaaggtat 

721 cacgtcatgg ggcagtgaac catgtgccct gcccgaaagg ccttccctgt 
acaccaaggt 

30 781 ggtgcattac cggaagtgga tcaaggacac catcgtggcc aacccctgag 

cacccctatc 

841 aaccccctat tgtagtaaac ttggaacctt ggaaatgacc aggccaagac 
tcaagcctcc 

901 ccagttctac tgacctttgt ccttaggtgt gaggtccagg gttgctagga 
35 aaagaaatca 

961 gcagacacag gtgtagacca gagtgtttct taaatggtgt aattttgtcc 
tctctgtgtc 

1021 ctggggaata ctggccatgc ctggagacat atcactcaat ttctctgagg 
acacagatag 

40 1081 gatggggtgt ctgtgttatt tgtggggtac agagatgaaa gaggggtggg 

atccacactg 

1141 agagagtgga gagtgacatg tgctggacac tgtccatgaa gcactgagca 

gaagctggag 

1201 gcacaacgca ccagacactc acagcaagga tggagctgaa aacataaccc 
45 actctgtcct 

1261 ggaggcactg ggaagcctag agaaggctgt gagccaagga gggagggtct 

tcctttggca 

1321 tgggatgggg atgaagtaag gagagggact ggaccccctg gaagctgatt 
cactatgggg 

50 1381 ggaggtgtat tgaagtcctc cagacaaccc tcagatttga tgatttccta 

gtagaactca 

1441 cagaaataaa gagctgttat actgtg 

// 

55 

D87459. Human autoiramunog . . , [gi: 1890098] 

LOCUS HSU87459 752 bp mRNA PRI 

22-DEC-1999 
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Chordata; 
Primates; 



Craniata; 
Catarrhini ; 



Sahin.U. , Tureci,0. , 
Pf reundschoih , M • and 



DEFINITION Human autoimmunogenic cancer/testis antigen NY-ESO- 
1 TtiRNA, complete cds. 
ACCESSION U87459 

VERSION U87459.1 GI: 1890098 

KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

EuJcaryota; Metazoa; 
Vertebrata; Euteleostomi; 

Mammalia; Eutheria; 

Homi ni dae ; Homo . 
REFERENCE 1 (bases 1 to 752) 
AUTHORS Chen.Y.T,, Scanlan^M. J. , 
Cure , A . O . , Tsang , S . , 

Williamson^ B. , Stockert.E. , 

Old, L.J. 

TITLE A testicular antigen aberrantly expressed in human 

cancers detected 

by autologous antibody screening 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (5), 1914-1918 (1997) 
MEDLINE 97203161 
PUBMED 9050879 
REFERENCE 2 (bases 1 to 752) 
AUTHORS Chen , Y . -T . 
TITLE Direct Submission 

JOURNAL Submitted (28-JAN-1997) Ludwig Institute for Cancer 
Research, New 

York Branch/ 1275 York Avenue, New York, NY 10021, 

USA 

FEATURES Location/Qualifiers 
source 1..752 

/organi sm= ^^Homo s ap i ens " 
/db_xref=^^taxon:9606" 
/ c lone__l ib = ^ e s ophageal 

carcinoma expression 

cDNA library" 
CDS 54.. 596 

/codon_start=l 
/products '^autoimmunogenic 

antigen NY-ESO-1" 

/protein_id=*^AAB49693 . 1" 
/db_xref= '^01:1890099'' 



squamous 



cell 



cancer/testis 



/translation- "MQAEGRGTGGSTGDADGPGGPGIPDGPGGNAGGPGEAG 
ATGGRGPRGAGAARASGPGGGAPRGPHGGAASGLNGCCRCGARGPBSRLLEF 
YLAMPFATPMEAELARRSLAQDAPPLPVPGVLLKEFTVSGNILTIRLTAADH 
RQLQLSISSCLQQLSLLMWITQCFLPVFLAQPPSGQRR" 

BASE COUNT 126 a 230 c 256 g 140 t 

ORIGIN 

1 atcctcgtgg gccctgacct tctctctgag agccgggcag aggctccgga 
gccatgcagg 

61 ccgaaggccg gggcacaggg ggttcgacgg gcgatgctga tggcccagga 
ggccctggca 

121 ttcctgatgg cccagggggc aatgctggcg gcccaggaga ggcgggtgcc 
acgggcggca 

181 gaggtccccg gggcgcaggg gcagcaaggg cctcggggcc gggaggaggc 
gccccgcggg 
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cggcgcggct 


tcagggctga 


atggatgctg 


cagatgcggg 


cctgcttgag 


ttctacctcg 


ccatgccttt 


cgcgacaccc 


caggagcctg 


gcccaggatg 


ccccaccgct 


tcccgtgcca 


cactgtgtcc 


ggcaacatac 


t crac fc a fr* fvi 




gctctccatc 


agctcctgtc 


tccagcagct 


ttccctgttg 


tctgcccgtg 


tttttggctc 


agcctccctc 


agggcagagg 


cccttcctag 


gtcatgcctc 


ctcccctagg 


gaatggtccc 


tgtgggggcc 


tgattgfcttg 


tcgctggagg 


aggacggctt 



gccagggggc 

301 cggagagc 
atggaagcag 

361 agctggcc 
ggggtgcttc 

421 tgaaggag 
gcagaccacc 

481 gccaactg^ 
atgtggatca 

541 cgcagtgd 
cgctaagccc 

601 agcctggcf 
agcacgagtg 

661 gccagttc< 
acatgtttgt 

721 ttctgtagaa aataaaactg agctacgaaa aa 

CAA11116. LAGE-la protein [... [gi : 3255959] 

LOCUS CAA11116 180 aa ppr 

23-JUN-1998 

DEFINITION LAGE-la protein iHomo sapiens] . 

ACCESSION CAA11116 

PID g3255959 

VERSION CAA11116.1 GI: 3255959 

DBSOURCE enibl locus HOS223093, accession AJ223093 1 

KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

„ ^ ^ Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata; Euteleostorai; 

. Mammalia; Eutheria; Primates; Catarrhini; 

Hoimnidae; Homo. 

REFERENCE 1 (residues 1 to 180) 

AUTHORS Lethe, B., Lucas.S., Michaux.L., De Smet,C., 
Goaelame/D- , , 

Serrano, A., De Plaen^E. and Boon,T. 

TITLE LAGE-1, a new gene with tumor specificity 

JOURNAL Int. J. Cancer 76 [6), 903-908 (1998) 

MEDLINE 98289662 

REFERENCE 2 (residues 1 to 180) 

AUTHORS Lethe, E.G. 

TITLE Direct Submission 

JOURNAL Submitted (08-JAN-1998) Lethe E.G., Brussels Branch 
Ludwig . ' 

^^titute for Cancer Research, 74 Avenue 
Hippocrate, B - 1200 - 

Briaxelles, BELGIUM 
Related sequences: AJ223040, AJ223041 and AJ003149, 
FEATURES Location/Qualifiers 
source 1..180 

/organism^^^Homo sapiens" 

/isolate^ 'individual LB33'' 

/db_xref=«taxon;9606" 

/ chromosome = "X" 

/map=^q28" 

/clone="c2RB" 
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/sex== "^Female" 
/cell_line= ^^LB3 3 -MELA" 
/ cell_type= "melanoma" 
/clone_lib= '^Be* 
5 Protein 1..180 

/products ''IiAGE-la protein" 
CDS 1..180 

/gene="LAGE-l" 
/ db_xr e f = " S PTREMBL : 07 5 6 3 7 
10 / coded_by = j o in ( A J2 23093. 1:1231. .1499, 

AJ223093. 1:2115. .2249, AJ223093.1; 2479. .2617)" 
ORIGIN 

1 mgaegrgtgg stgdadgpgg pgipdgpggn aggpgeagat ggrgprgaga 
15 arasgprgga 

61 prgphggaas aqdgrcpcga rrpdsrllel hitmpfsspra eaelvxrils 
rdaaplprpg 

121 avlkdftvsg nllfirltaa dhrqlqlsis sclqqlsllm witqcflpvf 
laqapsgqrr 
20 181 

// 

CAA11117, LAGE- lb protein [ . . . [gi :3255960] 

LOCUS CAA11117 210 aa PRI 

25 23-JUN-1998 

DEFINITION LAGE- lb protein [Homo sapiens] . 

ACCESSION CAA11117 

FID g3255960 

VERSION CAA11117,1 GI: 3255960 

30 DBSOURCE embl locus HOS223093, accession AJ2 23 093 . 1 

KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

35 Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo . 

REFERENCE 1 (residues 1 to 210) 

AUTHORS Lethe , B . , Lucas , S . , Michaux , L . , De Smet , C . , 
40 Godelaine , D . , 

Serrano, A., De Plaen,E. and Boon,T. 
TITLE LAGE-1, a new gene with tumor specificity 

JOURNAL Int. J. Cancer 76 (6), 903-908 (1998) 
MEDLINE 98289662 
45 REFERENCE 2 (residues 1 to 210) 

AUTHORS Le t he , B . G , 
TITLE Direct Submission 

JOURNAL Submitted (08- JAN-1998) Lethe B.C., Brussels Branch, 
Ludwig 

50 Institute for Cancer Research, 74 Avenue 

Hippocrate, B - 1200 - 

Bruxelles, BELGIUM 
COMMENT Related sequences: AJ223040, AJ223041 and AJ003149. 
FEATURES Location/Qualifiers 
55 source 1 . . 210 

/organism=''Homo sapiens" 
/isolate=^Hndividual LB33'' 
/db xref=«taxon:9606'' 
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Protein 

CDS 



2 boamdary" 
ORIGIN 

1 mqaegrgtgg 
arasgprgga 

61 prgphggaas 
rdaaplprpg 

121 avlkdftvsg 
pkhkvseqrp 

181 gtpgppppeg 

11 



/chroTnosoTne= '"X" 
/map^^qae" 
/clones ''c2RB" 
/sex=^*Female* 
/eel 1_1 ine= *IjB3 3 -MELA" 
/c el l__t ype= ^*me 1 anoma* 
/clone_lib= *^B6" 
1. .210 

/products ^LAGE- lb protein" 
1. .210 

/gene=*'LiAGE-l" 
/db_xre f = SPTREMBL : 07 5 6 3 8 " 
/coded_by=^*join(AJ223093 .1;1231. .1499, 
AiJ223093. 1:2115. .2478)" 

/note=*altemate splicing at exon 2 - intron 

stgdadgpgg pgipdgpggn aggpgeagat ggrgprgaga 

aqdgrcpcga rrpdsrllel hitmpfsspm eaelvrrils 

nllfmsvwdq dregagrmrv vgwglgsasp egqkardlrt 
aqgdgcrgva fnvmfsaphi 



30 M77481. Human antigen (MA [gi:416114] 

LOCUS HUMMAGIA 2420 bp DNA PRI 

15-N0V-1993 

DEFINITION Human antigen (MAGE-1) gene, conplete cds. 
ACCESSION M77481 
35 VERSION M77481.1 GI: 416114 

KEYWORDS antigen. 

SOURCE Homo sapiens (individual_isolate patient MZ2) 

melanoma abdominal 

metastasis of melanoma DNA. 

40 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo . 
45 REFERENCE 1 (bases 785 to 1286) 

AUTHORS van der Bruggen,P., Traversari , C . , Chomez^P., 
Lurquin,C., De 

Plaen,E., Van den Eynde^B., Knuth/A. and Boon,T. 
TITLE A gene encoding an antigen recognized by cytolytic T 

50 lymphocytes on 

a human melanoma 
JOURNAL Science 254, 1643-1647 (1991) 
MEDLINE 92086861 
REFERENCE 2 (bases 1 to 2420) 
55 AUTHORS van der Bruggen P. 

TITLE Direct Submission 

JOURNAL Submitted (05-FEB-1992) Pierre van der Bruggen, Ludwig 
Institute 
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for Cancer Research, Bmassels Branch, Avenue 
Hippocrate, 74, UCL 

7459, Brussels, B-1200, Belgium 
COMMENT On Nov 15, 1993 this sequence version replaced 

5 gi: 187294, 

FEATURES Locat ion/Qua li f iers 
source 1..2420 

/organisTn=!'*Homo sapiens" 
/db_xref=*taxon:9606'' 
10 /map=^X" 

/sexs** female" 
/cell_line= '^MZ2 -MEL . 43 " 
intron 1..412 

/partial 

15 /gene=**MAGE-l" 

/numbers 1 
exon 413., 485 

/gene=^MAGE-l'' 
/numbers 2 

20 mRNA join{413, .485,561. .2111) 

/partial 
/gene=^^MAGE-l" 
intron 486.. 560 

/gene="MAGE-l" 
25 /numbers 2 

exon 561., 2 111 

gene«^MAGE-l" 
/number=3 
gene 626.. 1555 

30 /gene=*^MAGE-l" 
CDS 626. .1555 

/gene=^MAGE-l" 
/codon_start=l 
/pr oduc t = "MAGE - 1 " 
35 /protein_id="AAA03229 .1" 

/db_xref="GI: 416115" 

/translations ^^MSLEQRSLHCKPEEALEAQQEALGLVCVQAATSSSSPL 
VLGTLEEVPTAGSTDPPQSPQGASAFPTTINFTRQRQPSEGSSSREEEGPST 
40 SCILESLFRAVITKKVADLVGFIiLLKYRAREPVTKAEMLESVIKNYKHCFPE 
IFGKASESLQLVFGIDVKEADPTGHSYVLVTCLGLSYDGLLGDNQIMPKTGF 
LIIVLVMIAMEGGHAPEEEIWEELSVMEVYDGREHSAYGEPRKLLTQDLVQE 
KYLEYRQVPDSDPARYEFLWGPRALAETSYVKVIiEYVIKVSARVRFFFPSLR 
EAALREEEEGV" 
45 polyA^signal 2095.. 2100 

/gene=«MAGE-l" 
intron 2112.. 2420 

/partial 
gene^^mGE-l" 
50 /number=3 

BASE COUNT 562 a 582 c 677 g 599 t 

ORIGIN 

1 ggatccaggc cctgccagga aaaatataag ggccctgcgt gagaacagag 

ggggtcatcc 

55 61 actgcatgag agtggggatg tcacagagtc cagcccaccc tcctggtagc 

actgagaagc 

121 cagggctgtg cttgcggtct gcaccctgag ggcccgtgga ttcctcttcc 
tggagctcca 
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181 ggaaccaggc agtgaggcct 
gcagaggatg 

241 cacagggtgt gccagcagtg 
ccacctgcca 
5 3 01 caggacacat aggactccac 

cctgtagaat 

361 cgacctctgc tggccggctg 
aggttttcag 

421 gggacaggcc aacccagagg 
10 ccaaggagaa 

481 gatctgtaag taggcctttg 
aggcctctca 

541 cacactccct ctctccccag 
ccacactcct 
15 601 gcctgctgcc ctgacgagag 

actgcaagcc 

661 tgaggaagcc cttgaggccc 
aggctgccac 

721 ctcctcctcc tctcctctgg 
20 ctgggtcaac 

781 agatcctccc cagagtcctc 
acttcactcg 

841 acagaggcaa cccagtgagg 
gcacctcttg 
25 901 tatcctggag tccttgttcc 

tggttggttt 

961 tctgctcctc aaatatcgag 
tggagagtgt 

1021 catcaaaaat tacaagcact 
30 agtccttgca 

1081 gctggtcttt ggcattgacg 
atgtccttgt 

1141 cacctgccta ggtctctcct 
tgcccaagac 
35 1201 aggcttcctg ataattgtcc 

ctcctgagga 

1261 ggaaatctgg gaggagctga 
acagtgccta 

1321 tggggagccc aggaagctgc 
40 tggagtaccg 

1381 gcaggtgccg gacagtgatc 
gggccctcgc 

1441 tgaaaccagc tatgtgaaag 
gagttcgctt 
45 1501 tttcttccca tccctgcgtg 

tctgagcatg 

1561 agttgcagcc aaggccagtg 
ggccgcgtcc 

1621 agcagcttcc cctgcctcgt 
50 agagcggtca 

1681 gtgttctcag tagtaggttt 
tttgttctct 

1741 tttggaattg ttcaaatgtt 
gcatccaagt 
55 1801 ttatgaatga cagcagtcac 

agtcttgtgt 

1861 tttattcaga ttgggaaatc 
gcagtggaat 



tggtctgaga 


cagtatcctc 


aggtcacaga 


aatgtttgcc 


ctgaatgcac 


accaagggcc 


agagtctggc 


ctcacctccc 


tactgtcagt 


taccctgagt 


accctctcac 


ttcctccttc 


acaggattcc 


ctggaggcca 


cagaggagca 


ttagagtctc 


caaggttcag 


ttctcagctg 


gcctgtgggt 


cttcattgcc 


cage tec tgc 


tcatcatgtc 


tcttgagcag 


aggagtctgc 


aacaagaggc 


cctgggcctg 


gtgtgtgtgc 


tcctgggcac 


cctggaggag 


gtgcccactg 


agggagcctc 


cgcctttccc 


actaccatca 


gttccagcag 


ccgtgaagag 


gaggggccaa 


gagcagtaat 


cactaagaag 


gtggctgatt 


ccagggagcc 


agtcacaaag 


gcagaaatgc 


gttttcctga 


gatcttcggc 


aaagcctctg 


tgaaggaagc 


agaccccacc 


ggccactcct 


atgatggcct 


gctgggtgat 


aatcagatca 


tggtcatgat 


tgcaatggag 


ggcggccatg 


gtgtgatgga 


ggtgtatgat 


gggagggagc 


tcacccaaga 


tttggtgcag 


gaaaagtacc 


ccgcacgcta 


tgagttcctg 


tggggtccaa 


tccttgagta 


tgtgatcaag 


gtcagtgcaa 


aagcagcttt 


gagagaggag 


gaagagggag 


ggagggggac 


tgggccagtg 


caccttccag 


gtgacatgag 


gcccattctt 


cactctgaag 


ctgttctatt 


gggtgacttg 


gagatttatc 


tttttttaag 


ggatggttga 


atgaacttca 


acagttctgt 


gtatatagtt 


taagggtaag 


cattctattt 


tgtgaattgg 


gataataaca 
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1921 aagtacttag 
ctaaagaaat 

1981 taagagatag 
ttaaagatat 

2041 atgcatacct 
tctgaataaa 

2101 gaattcttcc 
ctttttggaa 

2151 ggccctgggt 
ccatagggtc 

2221 gtagagtcta 
aaagatgtag 

2281 ggaaaagtga 
agtgtcaatg 

2341 ccctgagctg 
gagctgattg 

2401 taatgatctt 

II 



aaatgtgaaa 


aatgagcagt 


tcaattcttg 


ccttatacct 


ggatttcctt 


ggcttctttg 


tgttcactgg 


ctcttttctt 


tagtagtgga 


gatgctaagg 


ggagctgcag 


tcacgtaatc 


gagaggggtg 


agggtgtggg 


gggcattttg 


ggctttggga 


ggtggatcc 





aaaatagatg agataaagaa 

cagtctattc tgtaaaattt 

agaatgtaag agaaattaaa 

ctccatgcac tgagcatctg 

taagccagac tcatacccac 

gaggtggcaa gatgtcctct 

gctccgggtg agagtggtgg 

aactgcagtt ccttctgggg 



L18920. Human MAGE -2 gene . . . Igi :436180] 

LOCUS HUMMAGE2X 4559 bp DNA PRI 

20-APR-1994 

DEFINITION Human MAGE-2 gene exons 1-4, complete cds. 

ACCESSION LI 8 920 

VERSION L18920.1 GI: 436180 

KEYWORDS 

SOURCE Homo sapiens {human) . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata / Euteleostomi 

Mammalia ; Eutheria ; Primates ; Catarrhini ; 

Hominidae ; Homo , 

REFERENCE 1 {bases 1 to 4559) 

AUTHORS De Smet,C., Lurguin.C, van der Bruggen,P., De 
Plaen/E. , 

Brassexir,F. and Boon,T. 
TITLE Sequence and expression pattern of the human MAGE2 gene 

JOURNAL Iramunogenetics 39 (2), 121-129 (1994) 
MEDLINE 94102805 
FEATURES Location/ Qual i f iers 



source 



gene 
exon 

intron 

exon 



1. .4559 

/organism=''Homo sapiens" 
/isolate=^^MZ-2" 
/db_xref='»taxon:9606'' 
/map=^*X" 

/haplotype="Al A29 B37 B44 06" 

/sex= '^female" 

/cell_line=«MZ2-MEL . 43* 

/cell_type=»*melanoma cell line" 
561. .4496 

/gene=*MAGE-2" 
561. .754 

/gene=^r4AGE-2" 

/number =1 
755. .1850 

/gene=''MAGE-2" 

/number =1 
1851. .1969 
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/gene="MAGE-2" 



intron 



/number =2 
1970, .2788 



/gene=«MAGE-2* 



exon 



/nutiiber=2 
2789. .2854 



/gene=«MAGE-2'' 



intron 



/numbers 3 
2855.-2934 



/gene=^^MAGE-2^ 
/number =3 



exon 



2935. .4496 
/gene=''MAGE-2'' 
/number =4 



CDS 



3000. .3944 

/gene=''MAGE-2" 
/codon_start=l 
/protein_id= *AAA17729 . 1" 
/cib_xref = *GI : 43 618 1" 



/translation='*MPLEQRSQHCKPEEGLEARGEALGLVGAQAPATEEQQTASSSSTLVEVT 

LGEVPAADSPSPPHSPQGASSFSTTINYTLWRQSDEGSSNQEEEGPRMFPDLE 

SEFQAMSRK^^l^ELVHPLLLKYRAREPVTKAEMLESVLRNCQDFFPVIFSKASEYLQLVFGIE 

WEWPISHLYILVTCLGLSYDGLLGDNQWPKTGLLIIVLAIIAIEGDCAPEEKIWEELSML 

EVFEGREDSVFAHPRKLLMQDLVQENYLEyRQVPGSDPACYEFLWGPR?^IETSYVKV^ 

KIGGEPHISYPPLHERALREGEE" 

polyA_signal 4479. .4484 

/gene=*MAGE-2'' 
BASE COUNT 1049 a 1280 c 1283 g 947 t 

ORIGIN 

1 attccttcat caaacagcca ggagtgagga agaggaccct cctgagtgag 

gactgaggat 

61 ccaccctcac cacatagtgg gaccacagaa tccagctcag cccctcttgt 

cagccctggt 

121 acacactggc aatgatctca ccccgagcac acccctcccc ccaatgccac 

ttcgggccga 

181 ctcagagtca gagacttggt ctgaggggag cagacacaat cggcagagga 
tggcggtcca 

241 ggctcagtct ggcatccaag tcaggacctt gagggatgac caaaggcccc 
tcccaccccc 

301 aactcccccg accccaccag gatctacagc ctcaggatcc ccgtcccaat 
ccctacccct 

361 acaccaacac catcttcatg cttaccccca cccccccatc cagatcccca 
tccgggcaga 

421 atccggttcc acccttgccg tgaacccagg gaagtcacgg gcccggatgt 
gacgccactg 

481 acttgcacat tggaggtcag aggacagcga gattctcgcc ctgagcaacg 
gcctgacgtc 

541 ggcggaggga agcaggcgca ggctccgtga ggaggcaagg taagacgccg 
agggaggact 

601 gaggcgggcc tcaccccaga cagagggccc ccaataatcc agcgctgcct 
ctgctgccgg 

661 gcctggacca ccctgcaggg gaagacttct caggctcagt cgccaccacc 
tcaccccgcc 

721 accccccgcc gctttaaccg cagggaactc tggcgtaaga gctttgtgtg 

accagggcag 

781 ggctggttag aagtgctcag ggcccagact cagccaggaa tcaaggtcag 
gaccccaaga 
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841 ggggactgag ggcaacccac 
aacaccaacc 

901 ccacccccat ccctcaaaca 
tcctccccca 
5 961 ccaccatcct ggcagaatcc 

ctccgggaat 

1021 ggcggccaag cacgcggatc 

ci^gsfggttgg 

1081 gtctcgtgag tatggccttt 
10 gaagacagtg 

1141 gagtccttag gggacccagc 
gtctcaaact 

1201 gagccacctt ttcattcagc 
cagcaggggg 
15 1261 ttggggccca gcctgcgagg 

agggg3.cctt 

1321 ggagtccaga tcagtggcaa 
ccgaatgtgc 

1381 cccgtgctca ttgcaccttc 
20 gagggctggg 

1441 acttcaggtc agcagaggga 
tgcccccttc 

1501 atgaggactg gggatacccc 
tggaagtccc 
25 1561 ttgttcttag ctctggggga 

ctcatttgta 

1621 ccacaggcag gaggttgggg 
3-gaggagctg 

1681 tctgctcatt tcagggggtt 
30 agtaaagatg 

1741 agtaacccac aggaggccat 
agccctggac 

1801 aacgcacgtg ggggtaacag 
atctcaggga 
35 1861 gttgatgacc ttgttttcag 

tctggtcgac 

1921 agatgcagtg gttctaggat 
taggattgag 

1981 ggtacccctg ggccagaactg 
40 gcccctgcgg 

2041 ttacttcaga gaccctgggc 
ctgggatctt 

2101 tgatgtcagg gaaggggagg 
gtagagggag 
45 2161 ggtctcaggc cctgccagga 

ccaggacacc 

22 21 tggactccaa tgaatttgga 
ggtcacgtat 

22 81 ggccagatgt gggtcccctc 
50 gttcttgaca 

2341 tgagagattc tcaagccagc 
aaggtgaggg 

2401 ccctgagtga gcacagaggg 
acggagtctg 
55 2461 gccaaccctg ctgagacttc 

acactgaagg 

2521 cccgtgcatt cctctcccag 
ggccttggtc 



cccctaccct cactaccaat cccatccccc 
ccaaccccac ccccaaaccc cattcccatc 
ggctttgccc ctgcaatcaa cccacggaag 
ctgacgttca catgtacggc taagggaggg 
gggatgcaga ggaagggccc aggcctcctg 
atgccaggac agggggccca ctgtacccct 
cgagggaatc ctagggatgc agacccactt 
agtcaagggg aggaagaaga gggaggactg 
ccttgggctg ggggatcctg ggcacagtgg 
agggtgacag agagttgagg gctgtggtct 
ggaatcccag gatctgccgg acccaaggtg 
cggcccagaa agaagggatg ccacagagtc 
acctgatcag ggatggccct aagtgacaat 
aaccctcagg gagataaggt gttggtgtaa 
gggggttgag aaagggcagt ccctggcagg 
cataacgttc accctagaac caaaggggtc 
gatgtggccc ctcctcactt gtctttccag 
aaggtgactc aggtcaacac aggggcccca 
ctgccaagca tccaggtgga gagcctgagg 
cagcaagggg gccccataga aatctgccct 
agggctgtca gctgaagtcc ctccattatc 
ccttggtctg aaggggctgg agtcaggtca 
gtggacgtga ggaccaagcg gactcgtcac 
catctctcgt tgtccttcgc gggaggacct 
atatccttct gtaccatatc agggatgtga 
aaaagggtgg gattaggccc tacaaggaga 
gaccctccac ccaagtagag tggggacctc 
tgggaatccg tggctgtgct tgcagtctgc 
gaatcaggag ctccaggaac caggcagtga 
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2581 tgagtcagtg 
cactgaaggt 

2641 ttgcctggaa 
ccagagggcc 

2701 tggcctcacc 
ggctgtaccc 

2761 tgaggtgccc 
aagtaggacc 

2821 cgaggcactg 
cagagcctcc 

2881 aaggttcagt 
ccaggcctgt 

2941 gggtcttcat 
agagtcatca 

3001 tgcctcttga 
gcccgaggag 

3061 aggccctggg 
accgcttctt 

3121 cctcttctac 
tcaccgagtc 

3181 ctccccacag 
actctttgga 

3241 gacaatccga 
tttcccgacc 

3301 tggagtccga 
cattttctgc 

3361 tcctcaagta 
agtgtcctca 

3421 gaaattgcca 
ttgcagctgg 

3481 tctttggcat 
cttgtcacct 

3541 gcctgggcct 
aagacaggcc 

3 601 tcctgataat 
gaggagaaaa 

3661 t c tgggagga 
gtcttcgcac 

3721 atcccaggaa 
taccggcagg 

3781 tgcccggcag 
ctcattgaaa 

3841 ccagctatgt 
cacatttcct 

3901 acccacccct 
gcacatgttg 

3961 cagccagggc 
catccattag 

4021 cttccactgc 
gcagtcagca 

4081 ttcttagcag 
ctttcctgtt 

4141 ggaattgttc 
ccaagtttat 

4201 gaatgacag t 
tcctgttttt 

4261 tattcagatt 
gcagtggaat 



tcctcaggtc acagagcaga 
tgcacaccaa gggccccacc 
ctccctattc tcagtcctgc 
tcccacttcc tccttcaggt 
gaggagcatt gaaggagaag 
tcagttctca cctaaggcct 
tgcccagctc ctgcccgcac 
gcagaggagt cagcactgca 
cctggtgggt gcgcaggctc 
tctagtggaa gttaccctgg 
tcctcaggga gcctccagct 
tgagggctcc agcaaccaag 
gttccaagca gcaatcagta 
tcgagccagg gagccggtca 
ggacttcttt cccgtgatct 
cgaggtggtg gaagtggtcc 
ctcctacgat ggcctgctgg 
cgtcctggcc ataatcgcaa 
gctgagtatg ttggaggtgt 
gctgctcatg caagatctgg 
tgatcctgca tgctacgagt 
gaaagtcctg caccatacac 
gcatgaacgg gctttgagag 
cagtgggagg gggtctgggc 
ctcgtgtgat atgaggccca 
tgagtttctg ttctgttgga 
aaatgttcct tttaacaaat 
agtcacacat agtgctgttt 
gggaaatcca ttccattttg 



ggggacgcag acagtgccaa 
cgcccagaac aaatgggact 
agcctgagca tgtgctggcc 
t c t gaggggg acaggc tgac 
atctgtaagt aagcctttgt 
cacacacgct ccttctctcc 
tcctgcctgc tgccctgacc 
agcctgaaga aggccttgag 
ctgctactga ggagcagcag 
gggaggtgcc tgctgccgac 
tctcgactac catcaactac 
aagaggaggg gccaagaatg 
ggaaga tgg t tgag t tgg 1 1 
caaaggcaga aatgctggag 
tcagcaaagc ctccgagtac 
ccatcagcca cttgtacatc 
gcgacaatca ggtcatgccc 
tagagggcga ctgtgcccct 
ttgaggggag ggaggacagt 
tgcaggaaaa ctacctggag 
tcctgtgggg tccaagggcc 
taaagatcgg tggagaacct 
a^gggagELaga g tgagt c t ca 
cagtgcacct tccagggccc 
ttcctgcctc tttgaagaga 
tgactttgag atttatcttt 
ggttggatga acttcagcat 
atatagttta ggggtaagag 
tgagttgtca cataataaca 
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4321 atgtatttgc ctatattgtg aacgaattag cagtaaaata catgatacaa 
ggaactcaaa 

4381 agatagttaa ttcttgcctt atacctcagt ctattatgta aaattaaaaa 
tatgtgtatg 

4441 tttttgcttc tttgagaatg caaaagaaat taaatctgaa taaattcttc 
ctgttcactg 

4501 gctcatttct ttaccattca ctcagcatct gctctgtgga aggccctggt 
agtagtggg 
// 

U03735. Human MAGE -3 anti ... [gi : 468825] 

LOCUS HSU03 735 4204 bp DNA PRJ 

07 -APR- 19 94 

DEFINITION Human MAGE-3 antigen (MAGE-3) gene, complete cds. 
ACCESSION U03735 

VERSION U03735,l GI: 468825 

KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; 
Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo . 

REFERENCE 1 (bases 1 to 4204) 

AUTHORS Gaugler , B . , Van den Eynde , B . , van der Bruggen , P . , 
Romero, P. , 

Gaforio, J. J, , De Plaen^E., hethe,B., Brasseur^F. 

and Boon,T. 

TITLE Human gene MAGE-3 codes for an antigen recognized on 

a melanoma by 

autologous cytolytic T lyrrphocytes 
JOURNAL J. Exp. Med. 179, 921-930 (1994) 
MEDLINE 94157413 
REFERENCE 2 (bases 1 to 4204) 
AUTHORS Gaugl er , B . 
TITLE Direct Submission 

JOURNAL Submitted (25-NOV-1993) Beatrice Gaugler, Ludwig 

Institute for 

Cancer Research, 74 Avenue Hippocrate, Brussels 

1200, Belgium 

FEATURES Location/Qualifiers 
source i. .4204 

/organisra=**Homo sapiens'^ 

/isolate= "patient MZ2" 

/db_xref="taxon: 9606" 

/chromosome= 

/sex='*f emale*' 

/eel l_type='' lymphocyte" 

/tissue_type= "blood" 

/devest age« ^adulf 
exon 434 . .468 

/number=l 
exon 2254.. 2319 

/number=2 
exon 2400.. 3978 

/number=:3 
gene 2465.. 3409 

/gene=^\viAGE-3" 
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CDS 2465.. 3409 

/genes'^MAGE-S" 
/ codon_s t art=l 
/products '^MAGE- 3 antigen" 
/protein_id=^*AAA17446 . 1" 
/db_xref ='»GI : 46882S" 



/translations "MPLEQRSQHCKPEEGLEARGEALGLVGAQAPATEEQEAASSSSTLVEVTLGE 

VPAAESPDPPQSPQGASSLPTTimPLWSQSyEDSSNQEEEGPSTFPDLESEPQAALSRKVAELVH 

FLLLKYRAREPVTKAEMLGSWGmQYFFPVIPSKASSSLQLVFGIEIiME^ 

SYDGLLGDNQIMPKAGLLIIVIiAIIAREGDCAPEEKIWEELSVLEVPEGREDSILGDPK^ 

VQENYLBYRQVPGSDPACYEFLWGPRALVETSYVKVIJfflMVKISGGPHI 

polyA_signal 3958 . .3963 

polyA_site 3979 
BASE COUNT 944 a 1144 c 1223 g 893 t 

ORIGIN 

1 acgcaggcag tgatgtcacc cagaccacac cccttccccc aatgccactt 
cagggggtac 

61 tcagagtcag agacttggtc tgaggggagc agaagcaatc tgcagaggat 
ggcggtccag 

121 gctcagccag gcatcaactt caggaccctg agggatgacc gaaqqccccq 
cccacccacc 

181 cccaactccc ccgaccccac caggatctac agcctcagga cccccqtccc 
•aatccttacc 

241 ccttgcccca tcaccatctt catgcttacc tccaccccca tccgatcccc 
atccaggcag 

301 aatccagttc cacccctgcc cggaacccag ggtagtaccg ttgccaqaat 
gtgacgccac ^ ^ ^ts 

361 tgacttgcgc attggaggtc agaagaccgc gagattctcg ccctgagcaa 
cgagcgacgg 

421 cctgacgtcg gcggagggaa gccggcccag gctcggtgag gaggcaaggt 
aagacgctga 

481 gggaggactg aggcgggcct cacctcagac agagggcctc aaataatcca 
gtgctgcctc 

541 tgctgccggg cctgggccac cccgcagggg aagacttcca ggctgggtcg 
ccactacctc ^ 

601 accccgccga cccccgccgc tttagccacg gggaactctg gggacagagc 
ttaatgtggc 

661 cagggcaggg ctggttagaa gaggtcaggg cccacgctgt ggcaggaatc 
aaggtcagga 3333 

721 Gcccgagagg gaactgaggg cagcctaacc accaccctca ccaccattcc 
cgtcccccaa 

781 cacGcaaccc cacccccatc ccccattccc atccccaccc ccacccctat 
cctggcagaa 

841 tccgggcttt gcccctggta tcaagtcacg gaagctccgg gaatqqcqqc 

caggcacgtg ^ ^ 33 33 

901 agtcctgagg ttcacatcta cggctaaggg agggaagggg ttcggtatcq 
cgagtatggc ^3 

961 cgttgggagg cagcgaaagg gcccaggcct cctggaagac agtgqaqtcc 
tgaggggacc ^ 

ccttttcatt ^^^^^^^^^^ ggacaggggg cccactgtac ccctgtctca aaccgaggca 

cccagccctg aatcctaggg atgcagaccc acttcagcag ggggttgggg 

ccagatcagt tggggaggaa gaagagggag gactgagggg accttggagt 

gctcattgcg Sr^ctggggga tgctgggcac agtggccaaa tgtgctctgt 
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1261 ccttcagggt gaccagagag 
caggtcagca 

1321 gagggaggaa tcccaggatc 
ccctatgtgg 
5 1381 tggacagatg cagtggtcct 

tgagggagga 

1441 ttgagggtac ccctgggaca 
gccctgctcc 

1501 tgctgttacc tcagagagcc 
10 ttatcctagg 

1561 atcactgatg tcagggaagg 
gggcagtaga 

1621 gggaggctct cagaccctac 
ctcacccagg 
15 1681 gtacatggac ttcaataaat 

gacctgggaa 

1741 tgtatggcca gatgtgggtc 
tgtgagttct 

1801 tgacatgaga gattctcagg 
20 ggagaaaggt 

1861 gagggccctg agtgagcaca 
acctcacaga 

1921 gtctggccaa ccctcctgac 
tctgcacatt 
25 1981 gggggcccgt ggattcctct 

agtgaggact 

2041 tggtctgagg cagtgtcctc 
gccaacggtg 

2101 aaggtttgcc ttggattcaa 
30 ggactccaga 

2161 gcgcctggcc tcaccctcaa 
tggccggatg 

2221 taccctgagg tgccctctca 
tgacctggag 
35 2281 gaccagaggc ccccggagga 

tttgttagag 

2341 cctccaaggt tccattcagt 
tctccccagg 

2401 ccagtgggtc tccattgccc 
40 tgaccagagt 

2461 catcatgcct cttgagcaga 
ttgaggcccg 

2521 aggagaggc c c tgggc c t gg 
aggaggctgc 
45 2581 ctcctcctct tctactctag 

ccgagtcacc 

2641 agatcctccc cagagtcctc 
actaccctct 

2701 ctggagccaa tcctatgagg 
50 gcaccttccc 

2761 tgacctggag tccgagttcc 
tggttcattt 

2821 tctgctcctc aagtatcgag 
tggggagtgt 
55 2881 cgtcggaaat tggcagtatt 

gttccttgca 

2941 gctggtcttt ggcatcgagc 
acatctttgc 
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ttgagggctg 


tggtctgaag 


agtgggactt 


tgcagggccc 


aaggtgtacc 


cccaaggggc 


aggatctgcc 


aagcatccag 


gtgaagagac 


gaatgcggac 


tgggggcccc 


ataaaaatct 


tgggcagggc 


tgtcagctga 


ggtccctcca 


ggaagccttg 


gtctgagggg 


gctgcactca 


taggagtgga 


ggtgaggacc 


aagcagtctc 


ttggacatct 


ctcgttgtcc 


tttccgggag 


ccctcatgtt 


tttctgtacc 


atatcaggta 


ccagcagaag 


ggagggatta 


ggccctataa 


gaggggatcc 


tccaccccag 


tagagtgggg 


agttctggga 


atccgtggct 


gcgtttgctg 


cccaggaatc 


aggagctcca 


ggaacaaggc 


aggtcacaga 


gtagaggggg 


ctcagatagt 


accaagggcc 


ccacctgccc 


cagaacacat 


tactttcagt 


cctgcagcct 


cagcatgcgc 


cttcctcctt 


caggttctga 


ggggacaggc 


gcactgaagg 


agaagatctg 


taagtaagcc 


actcagctga 


ggtctctcac 


atgctccctc 


agctcctgcc 


cacactcccg 


cctgttgccc 


ggagtcagca 


ctgcaagcct 


gaagaaggcc 


tgggtgcgca 


ggctcctgct 


actgaggagc 


ttgaagtcac 


cctgggggag 


gtgcctgctg 


agggagcctc 


cagcctcccc 


actaccatga 


actccagcaa 


ccaagaagag 


gaggggccaa 


aagcagcact 


cagtaggaag 


gtggccgagt 


ccagggagcc 


ggtcacaaag 


gcagaaatgc 


tctttcctgt 


gatcttcagc 


aaagcttcca 


tgatggaagt 


ggaccccatc 


ggccacttgt 
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3001 cacctgcctg ggcctctcct acgatggcct gctgggtgac aatcagatca 
tgcccaaggc 

3061 aggcctcctg ataatcgtcc tggccataat cgcaagagag ggcgactgtg 
cccctgagga 

5 3121 gaaaatctgg gaggagctga gtgtgttaga ggtgtttgag gggagggaag 

acagtatctt 

3181 gggggatccc aagaagctgc tcacccaaca tttcgtgcag gaaaactacc 
tggagtaccg 

3241 gcaggtcccc ggcagtgatc ctgcatgtta tgaattcctg tggggtccaa 
10 gggccctcgt 

3301 tgaaaccagc tatgtgaaag tcctgcacca tatggtaaag atcagtggag 

gacctcacat 

3361 ttcctaccca cccctgcatg agtgggtttt gagagagggg gaagagtgag 

tctgagcacg 

15 3421 agttgcagcc agggccagtg ggagggggtc tgggccagtg caccttccgg 

ggccgcatcc 

3481 cttagtttcc actgcctcct gtgacgtgag gcccattctt cactctttga 
agcgagcagt 

3541 cagcattctt agtagtgggt ttctgttctg ttggatgact ttgagattat 
20 tctttgtttc 

3601 ctgttggagt tgttcaaatg ttccttttaa cggatggttg aatgagcgtc 
agcatccagg 

3661 tttatgaatg acagtagtca cacatagtgc tgtttatata gtttaggagt 
aagagtcttg 

25 3721 ttttttactc aaattgggaa atccattcca ttttgtgaat tgtgacataa 

taatagcagt 

3781 ggtaaaagta tttgcttaaa attgtgagcg aattagcaat aacatacatg 
agataactca 

3841 agaaatcaaa agatagttga ttcttgcctt gtacctcaat ctattctgta 
30 aaattaaaca 

3901 aatatgcaaa ccaggatttc cttgacttct ttgagaatgc aagcgaaatt 
aaatctgaat 

3961 aaataattct tcctcttcac tggctcgttt cttttccgtt cactcagcat 

ctgctctgtg 

35 4021 ggaggccctg ggttagtagt ggggatgcta aggtaagcca gactcacgcc 

tacccatagg 

4081 gctgtagagc ctaggacctg cagtcatata attaaggtgg tgagaagtcc 

tgtaagatgt 

4141 agaggaaatg taagagaggg gtgagggtgt ggcgctccgg gtgagagtag 
40 tggagtgtca 

4201 gtgc 

// 

AF043498. Homo sapiens pros ... [gi : 2909843] 
45 LOCUS AF043498 990 bp mRKA PRI 24- 

FEB-1998 

DEFINITION Homo sapiens prostate stem cell antigen (PSCA) 
mRNA, complete cds . 
ACCESSION AF043498 
50 VERSION AF043498.1 GI: 2909843 

KEYWORDS 

SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
55 Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 990) 
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AUTHORS Reiter,R«E., Gu,Z., Watabe,T. , Thomas, G., Kinga,S., 
Davis ^ E . , 

Wahl,M., Nisitani,S., Yaraashiro, J, , Le Beau,M.M., 

Losa,M. and 
5 Witte,O.N. 

TITLE Prostate stem cell antigen: a cell surface marker 

overexpressed in 

prostate cancer 

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 95 (4), 1735-1740 (1998) 
10 MEDLINE 98132661 

REFERENCE 2 (bases 1 to 990) 
AUTHORS Reiter^R.E. 
TITLE Direct Siibmission 

JOURNAL Submitted (19-JAN-1998) Urology, UCLA, 66-134 CHS 
15 10833 Le Conte 

Ave., Los Angeles, OA 90095, USA 
FEATURES Location/Qualifiers 
source 1. .990 

/organism= *Homo sapiens" 
20 /db_xref =*taxon: 9606" 

/ chr omos ome = " 8 " 
/map=*^8q24.2" 

/note=''LAPC-4 prostate cancer xenograft" 
gene 1. .990 

25 /gene^^PSCA" 
CDS 18.. 389 

/gene-"PSCA" 

/note=''GPI-anchored cell surface protein" 
/codon_start=l 

30 /product=**prostate stem cell antigen" 

/protein_id="AAC39607 . 1" 
/db_xref=^^GI: 2 909844" 

/translation=^^MKAVLLALLMAGLALQPGTALLCYSCKAQVSNEDCLQVENCTQLGEQCW 
35 TARIRAVGLLTV^ISKGCSLNCVDDSQDYYVGKKNITCCDTDLCNASGAHALQPAAAILALLPA 
LGLLLWGPGQL" 



40 



45 



50 



55 



BASE COUNT 


193 a 


299 c 


285 g 202 t 11 


others 


ORIGIN 












1 


agggagaggc 


agtgaccatg 


aaggctgtgc 


tgcttgccct 


gttgatggca 


ggcttggccc 












61 


tgcagccagg 


cactgccctg 


ctgtgctact 


cctgcaaagc 


ccaggtgagc 


aacgaggact 












121 


gcctgcaggt 


ggagaactgc 


acccagctgg 


gggagcagtg 


ctggaccgcg 


cgcatccgcg 












181 


cagttggcct 


cctgaccgtc 


atcagcaaag 


gctgcagctt 


gaactgcgtg 


gatgactcac 












241 


aggactacta 


cgtgggcaag 


aagaacatca 


cgtgctgtga 


caccgacttg 


tgcaacgcca 












301 


gcggggccca 


tgccctgcag 


ccggctgccg 


ccatccttgc 


gctgctccct 


gcactcggcc 












361 


tgctgctctg 


gggacccggc 


cagctatagg 


ctctgggggg 


ccccgctgca 


gcccacactg 












421 


ggtgtggtgc 


cccaggcctt 


tgtgccactc 


ctcacagaac 


ctggcccagt 


gggagcctgt 












481 


cctggttcct 


gaggcacatc 


ctaacgcaag 


tttgaccatg 


tatgtttgca 


ccccttttcc 












541 


ccnaaccctg 


accttcccat 


gggccttttc 


caggattccn 


accnggcaga 



tcagttttag 
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601 tganacanat ccgcntgcag atggcccctc caaccntttn 
tccatggccc 

661 agcattttcc acccttaacc ctgtgttcag gcacttnttc 
ccttccctgc 

5 721 ccaccccatt tatgaattga gccaggtttg gtccgtggtg 

ccagcagggg 

781 acaggcaatc aggagggccc agtaaaggct gagatgaagt 

gaactggagg 

841 acaagagttg acgtgagttc ctgggagttt ccagagatgg 
10 cctggaggaa 

901 ggggccaggc ctcacatttg tggggntccc gaatggcagc 
cgtaggccct 

961 taataaacac ctgttggata agccaaaaaa 

// 

15 

P06870. GLANDULAR KALLIKR. [gi: 125170] 
LOCUS KLK1_HUMAN 262 aa PRI 20- 

AUG-2001 

DEFINITION GLANDULAR KALLIKREIN 1 PRECURSOR (TISSUE 

20 KALLIKREIN) 

(KIDNEY/PANCREAS/SALIVARY GLAND KALLIKREIN) . 
ACCESSION P06870 
PID gl25170 
VERSION P06870 GI: 125170 

25 DBSOURCE swissprot: locus KLK1_HUMAN, accession P06870; 

class : s tandar d . 

extra accessions :Q9UMJ1, created: Jan 1, 1988. 
sequence updated: Jan 1, 1988. 
annotation updated: Aug 20, 2001. 
30 xrefs: gi : gi : 186652/ gi : gi: 186653, gi: gi : 186649, 

gi: gi: 

186651, gi: gi : 186645, gi : gi : 186646, gi : gi : 
186647, gi: gi: 

186648, gi: gi: 34026, gi : gi : 34027, gi : gi : 186643, 

35 . gi : gi : 

386843, gi: gi : 67558 

xrefs (non-sequence, databases) : HSSP P00757, MEROPS 

SOI. 160, 

GlycoSuiteDB P06870, MIM 147910, InterPro IPR001314, 

40 InterPro 

IPR001254, Pfam PF00089, PRINTS PR00722, PROSITE 
PS50240, PROSITE 

PS00134, PROSITE PS00135 
KEYWORDS Hydrolase; Serine protease; Glycoprotein; Multigene 

45 family; 

Zymogen; Signal . 
SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
50 Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Horainidae; 

* Homo . 

REFERENCE 1 (residues 1 to 262) 

AUTHORS Fukushima , D , , Ki t amura , N . and Nakanishi , S . 
55 TITLE Nucleotide sequence of cloned cDNA for hiiman 

pancreatic kallikrein 

JOURNAL Biochemistry 24, 8037-8043 (1985) 
REMARK SEQUENCE FROM N.A. 



tgttgntgtt 
ccccaggaag 
tcccccgcac 
ggactgagta 
ggcctggagg 
ctgagcacag 
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T I S SUE =Pancr eas 
REFERENCE 2 (residues 1 to 2 62) 

AUTHORS Evans, B. A., Yiin^Z.X., Close, J. A., Tregear,G.W. , 

Kitaimira/N. , 

Nakanishi^S. , Callen.D.F., Baker, E., Hyland, V. J. , 
Sutherland^G.R. 

and Richards , R . I . 
TITLE Structure and chromosoraal localization of the human 

renal 

kallikrein gene 

Biochemistry. 27 (9), 3124-3129 (1988) 
88269498 
2898948 

SEQUENCE ETIGM N.,A. 
TISSUE=Kidney 

3 (residues l to 262) 

Angermann^A. , Bergmann,C, and Appelhans,H. 

Cloning and expression of human salivary-gland 
kallikrein in 

Escherichia coli 

The Biochemical journal. 262 (3), 787-793 (1989) 

90073574 
2686621 

SEQUENCE FROM N.A. 
TISSUE=Salivary gland 

4 (residues 1 to 262) 
Baker, A. R. and Shine, J. 

Human kidney kalli3crein: cDNA cloning and sequence 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 



REFERENCE 

AUTHORS 

TITLE 
analysis 

JOURNAL 

MEDLINE 
PUBMED 

REMARK 

REFERENCE 
AUTHORS 
TITLE 

and sites of 

JOURNAL 
33 (4), 

MEDLINE 
PUBMED 
REMARK 



DNA (Mary Ann Liebert, Inc.) 4 (6), 445-450 (1985) 
86135264 
3853975 

SEQUENCE OF 17-262 FROM N.A. 

TISSXJrE=Kidney 
5 (residues 1 to 2 62) 

Lu,H,S., Lin,F.K., Chao,L. and Chao,J. 

Human urinary 3callikrein. Complete amino acid sequence 
glycosylation 

International journal of peptide and protein research. 



237-249 (1989) 
89326688 
2666327 

SEQUENCE OF 25-262. 
TISSUE=Urine 
6 (residues 1 to 262) 

Kellermann, J. , Lottspeich, F. , 



REFERENCE 

AUTHORS Kellermann, J. , Lottspeich, F. , Geiger,R, and 

Deut zraaiin,R. 

TITLE Human urinary kalli3crein — amino acid sequence and 

carbohydrate 

attachment sites 

JOURNAL Protein sequences & data analysis. 1 (3), 177-182 
(1988) 

MEDLINE 88203586 

PUBMED 3163150 
REMARK SEQUENCE OF 25-262, AND CARBOHYDRATE -LINKAGE SITES. 
TISSUE=Urine 
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REFERENCE 7 (residues 1 to 262) 

AUTHORS Lottspeich,F. , Geiger,R., Henschen,A, and Kut2bach,C. 

TITLE N-Terminal amino acid sequence of human urinary- 

kail ikrein homology 

v/ith other serine proteases 

Hoppe-Seyler's Zeitechrift fur physiologische Chemie. 



10 
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1947-1950 (1979) 

80114126 

393608 

SEQUENCE OF 25-55. 

TISSUE=Urine 

8 (residues 1 to 262) 

Takahashi , S . , Irie^A. , Katayaraa,Y., Ito,K. 



JOURNAL 
360 (12), 

MEDLINE 
PUBMED 
REMARK 

REFERENCE 

AUTHORS Takahashi , S . , Irie^A. , Katayaraa,Y., Ito,K. and 

Miyake , Y . 

TITLE N- terminal amino acid sequence of human urinary 

prokallikrein 

JOURNAL Journal of biochemistry. 99 (3), 989-992 (1986) 
MEDLINE 86223893 

PUBMED 3635530 
REMARK SEQUENCE OF 28-47. 
TISSUE=Urine 

(FUNCTION] GLANDULAR KALLIKREINS CLEAVE MET-LYS AND 
ARG-SER BONDS 

IN KININOGEN TO RELEASE LYS-BRADYKININ. 

[CATALYTIC ACTIVITY] PREFERENTIAL CLEAVAGE OF ARG- 1 - 

XAA BONDS IN 

SMALL MOLECULE SUBSTRATES. HIGHLY SELECTIVE ACTION TO 

RELEASE 

KALLIDIN (LYSYL-BRADYKININ) FROM KININOGSN INVOLVES 
HYDROLYSIS- OF 

MET- 1 -XAA OR LEU- 1 -XAA. 

[SIMILARITY] BELONGS TO PEPTIDASE FAMILY SI; ALSO 

KNOWN AS THE 

TRYPSIN FAMILY. KALLIKREIN SUBFAMILY, 



FEATURES 

source 



Protein 

Region 

Region 

Region 

Bond 

Bond 



Location/Qualifiers 
1. .262 

/ organi sm= *^Homo s api ens " 
/db_xref = *taxon: 9606" 
1. .262 
1. .262 

/product="*GLANDULAR KALLIKREIN 1 PRECURSOR" 
/EC^number = «3 . 4 . 2 1 . 3 5« 
1. .18 

/region_name=: '*Signal'' 
/not e= " PROBABLE . " 
19. .24 

/region_naTne= "Propeptide" 

/note= ^ACTIVATION PEPTIDE (PROBABLE)." 

25. .262 

/region_name=«Matiire chain" 
/note= ^GLANDULAR KALLIKREIN 1." 
bond(31,174) 
/bond_type=*disulfide" 
/note=*BY SIMILARITY." 
bond(50, 66) 

/bond_type = di sul fide" 
/note-*^BY SIMILARITY." 
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Site 



Site 



Site 



Site 



Site 



Site 



Region 



Bond 



Site 



Site 



Bond 



Region 



Bond 



Site 



65 

/si te_type= "active" 

/note= "CHARGE RELAY SYSTEM." 

93 

/site_type= "glycosylation" 

/note= '^O-LINKED . " 

102 

/si t e_t ype = ^^gly cosy 1 at ion" 
/note="N-LINKED (GLCNAC. . . ) . " 
104 

/site_type="glycosylation* 

/note= "0-LINKED . " 

108 

/sit e__t ype = "gly c o sy 1 a t i on" 
/note= *N-LINKED (GLCNAC 
120 

/site_type= "act ive" 
/note='»CHARGE RELAY SYSTEM." 
145 

/region_naine= '^Variant'' 

/note=*Q -> B. /FTId=:VAR_006625 

bond(153,220) 

/bond__type== "disulfide" 

/note-*BY SIMILARITY." 

165 

/sit€_type= ""glycosylation" 
/note="N-LINKED (GLCNAC. . . ) (PARTIAL) . " 
167 

/site_type= *glycosylation" 
/note==«0 -LINKED . " 
bond(185,199) 
/bond_type= ^disulf ide" 
/note='^BY SIMILARITY." 
186 

/region_name= Variant" 

/note="E -> K. /FTId=VAR_006626." 

bond(210,235) 

/bond_type= "disulfide" 

/note="BY SIMILARITY." 

214 

/site_type= "active" 

/note= "CHARGE RELAY SYSTEM." 



ORIGIN 

1 mwflvlclal slggtgaapp iqsrivggwe ceqhaqpwqa alyhfstfqc 

ggilvhrqwv 

61 Itaahcisdn yqlwlgrhnl fddentaqfv hvsesfphpg fnmsllenht 

rqadedyshd 

121 Imllrltepa dtitdavkw elptqepevg stclasgwgs iepenfsfpd 
dlqcvdlkil 

181 pndecekahv qkvtdfmlcv ghleggkdtc vgdsggplmc dgvlc^rvtsw 
gyvpcgtpnk 

241 psvavrvlsy vlcwiedtiae ns 

// 



55 



P08217. ELASTASE 2A PRECU. [gi : 119255] 
LOCUS EL2A__HUMAN 269 aa 

AUG-2001 



PRI 



20- 
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DEFINITION 

ACCESSION 

PID 

VERSION 
DBSOURCE 



ELASTASE 2A PRECURSOR. 

P08217 

gll9255 

P08217 GI: 119255 

swissprot: locus Eri2A_HUMAN, accession P08217; 
class: standard, 
created: Aug 1, 1988. 
sequence updated: Aug 1, 1988. 
annotation updated: Aug 20, 2001. 

xrefs: gi: gi : 182022, gi: gi : 182023, gi : gi : 182057, 



gi: gi: 



MEROPS 
PRINTS 



SOI. 155, 
PR00722, 



182058, gi: gi : 88298, gi : gi : 88299 
xrefs (non-sequence databases) : 
InterPro IPR001314, 

InterPro IPR001254, Pfara PF00089, 
PROSITE PS50240, 

PROSITE PS00134, PROSITE PS00135 
KEYWORDS Hydrolase; Serine protease; Pancreas; Zymogen; Signal. 
SOURCE hiiman. 

ORGANISM Homo sapiens 

Eiikaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 
REFERENCE 

AUTHORS 

TITLE 
elastase II 



1 (residues 1 to 269) 

Kawashiraa , I . , Tani , T . , Shimoda , K . and Takiguchi , Y . 
Characterization of pancreatic elastase II cDNAs: two 



mRNAs are expressed in human pancreas 
DNA 6 (2) , 163-172 (1987) 

87217962 

SEQUENCE FROM N.A. 
2 (residues 1 to 269) 

Fletcher, T, S. , Shen,W.F. and Largman,C. 

Primary structure of human pancreatic elastase 2 

determined by 

sequence analysis of the cloned mRNA 
Biochemistry 26 (23), 7256-7261 (1987) 
88107669 

SEQUENCE FROM N.A. 
[FUNCTION] ACTS UPON ELASTIN. 

[CATALYTIC ACTIVITY] PREFERENTIAL CLEAVAGE: LEU- 1 -XAA, 



JOURNAL 

MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 



MET- I -XAA 



AND PHE-I-XAA. HYDROLYSES ELASTIN, 
[SUBCELLULAR LOCATION] SECRETED. 
[TISSUE SPECIFICITY] PANCREAS. 
[SIMILARITY] BELONGS TO PEPTIDASE 



FAMILY SI; ALSO 



KNOWN AS THE 

FEATURES 

source 



TRYPSIN FAMILY, ELASTASE SUBFAMILY - 
Location/Qualifiers 
1. -269 

/ organi sm= '^Homo s ap i ens " 
/db_xref =«taxon: 9606" 

Region 1 . , 16 

/ region_name= Signal" 

Protein 1. .269 

/product = "ELASTASE 2 A PRECURSOR" 
/EC numbers "3. 4. 21. 71" 
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Region 



Region 



Bond 



Site 



Site 



Bond 



Bond 



Bond 



Site 



17. .28 

/region_name= Propeptide" 
/note- ^ACTIVATION PEPTIDE." 
29. .269 

/region_name=*^Mature chain" 
/note=:^ELASTASE 2A." 
bond (58, 74) 

/bond_type= '^disulf ide" 
/note««BY SIMILARITY." 
73 

/site_type=**active" 

/note=^CHARGE RELAY SYSTEM {BY SIMILARITY)." 
121 

/site^types'^active" 

/note=**CHARGE RELAY SYSTEM (BY SIMILARITY)." 

bond (155, 222) 

/bond_type= "disulfide" 

/note=^BY SIMILARITY." 

bond (186,202) 

/bond_t yp e = di sul fide " 

/note=«BY SIMILARITY." 

bond (212, 243) 

/bond_type= **disulf ide" 

/note=^*BY SIMILARITY." 

216 

/ s i t e_type = ^^a c t i ve " 

/note="CHARGE RELAY SYSTEM (BY SIMILARITY)." 



ORIGIN 

1 mirtlllstl vagalscgdp typpyvtrw ggeearpnsw pwqvslqyas 
ngkwyhtcgg 

61 slianswvlt aahcisssrt yrvglgrhnl yvaesgslav svskiwhJcd 
wnsnqiskgn 

121 diallklanp vsltdkiqla clppagtilp nnypcyvtgw grlqtngavp 
dvlqqgrllv 

181 vdyatcsssa vwgssvktsm icaggdgvis scngdsggpl ncqasdgrwq 
vhgivsfgsr 

241 Igcnyyhkps vftrvsnyid winsviann 

// 



NP_056933. pancreatic elasta. .. [gi: 7705648] 
LOCUS NP_056933 269 aa 

NOV-2000 

DEFINITION pancreatic elastase IIB [Homo sapiens] 
NP__056933 
g7705648 

NP_056933 .1 GI : 7705648 
REFSEQ: accession NM 015849.1 



PRI 



02- 



ACCESSION 
PID 

VERSION 
DBSOURCE 
KEYWORDS 
SOURCE 

ORGANISM 



human . 

Homo sapiens 
Eukaryota; Metazoa ; 
Euteleostomi ; 

Mammalia; Eutheria; Primates; 
Homo. 

REFERENCE 1 (residues 1 to 269) 

AUTHORS Kawashima , I . , Tani , T , , Shimoda , K . and Takiguchi , Y . 

TITLE Ciiaracterization of pancreatic elastase II cDNAs: two 

elastase II 



Cliordata; Craniata; Vertebrata; 

Catarrhini ; Hominidae ; 
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ttiRNAs are expressed in human pancreas 
JOURNAL DNA 6 (2), 163-172 (1987) 
MEDLINE 87217962 
COMMENT PROVISIONAL REFSEQ: This record has not yet been 

subject to final 

NCBI review. The reference sec[uence was derived from 

M16653.1. 

Location/Qualifiers 
1. .269 

/organism=*Homo sapiens' 
/db_xref="taxon:9606" 
/chromosome^ ^ 12 " 
/map=^^12" 
1. .269 

/products "pancreatic elastase I IB" 
1..16 

/note=**pancreatic elastase IIB signal 



FEATURES 

source 



Protein 
sig_peptide 



peptide" 

Region 



mat_peptide 



28. .262 

/region^names'^Trypsin-like serine protease" 
/db_xref =^CDD :Tryp_SPc" 
/note=''Tryp_SPc" 
29.-269 . 

/products ^pancreatic elastase IIB mature 
31, -262 

/region_narae= '"Trypsin" 
/db_xref = *CDD : pf amOO 0 8 9" 
/not e= taryps in" 
1. .269 

/gene='*IiOC51032" 
/db_xr e f = ^Locus ID : 5 1 0 3 2 " 
/coded_by=:^NM__015849.1:26. .835" 

1 mirtlllstl vagalscgvs tyapdmsrml ggeearpnsw pwqvslqyss 
ngqwyhtcgg 

61 slianswvlt aahcisssri yrvmlgghnl yvaesgslav svskiwhkd 
wnsnqvskgn 

121 diallklanp vsltdkiqla clppagtilp nnypcyvtgw grlqtngalp 
ddlkqgrllv 

181 vdyatcsssg wwgstvktnm icaggdgvic tcngdsggpl ncqasdgrwe 
vhgigsltsv 

241 Igcnyyykps iftrvsnynd winsviann 

// 



peptide" 

Region 



CDS 



ORIGIN 



FRAME 

LOCUS NM_006115 2148 bp mRNA PRI 19-JUN- 

2001 

DEFINITION Homo sapiens preferentially expressed antigen in 
melanoma (PRAME) , 
mRNA. 

ACCESSION NM_006il5 

VERSION NM 006115.1 GI: 5174640 

KEYWORDS 

SOURCE human, 

ORGANISM Homo sapiens 



-186- 



wo 2003/008537 



PCT/US2002/010189 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Homo . 
REFERENCE 
AUTHORS 



1 (bases 1 to 2148) 

Ikeda,H., Lethe, B., Lehmann,F., VanBaren,N., 
Baurain, J. -F. , De 

Sraet.C, Charabost,H. , Vitale,M., Moretta,A. , Boon,T. and 
Coulie,P.G. 

TITLE Characterization of an antigen that is recognized on a 
melanoma 

showing partial HLA loss by CTL expressing an NK inhibitor 
receptor 

JOURNAL Irranimity 6 (2), 199-208 (1997) 

MEDLINE 97199265 
REFERENCE 2 (bases 1 to 2148) 

AUTHORS Williams JM, Chen GC, Zhu L and Rest RF, 

TITLE Using the yeast two-hybrid system to identify human 
epithelial cell 

proteins that bind gonococcal Opa proteins: intracellular 



bind pyruvate kinase via their Opa proteins and 
pyruvate for growth 

Mol, Microbiol. 27 (1), 171-186 (1998) 

98125741 

9466265 

3 (bases 1 to 2148) 



gonococci 

require host 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 

AUTHORS van Bar en, N., Chambost,H., Ferrant,A., Michaux,L., Ikeda,H 

Millard,!., Olive, D., Boon,T. and Coulie,P.G. 
TITLE FRAME, a gene encoding an antigen recognized on a 
human melanoma by 

cytolytic T cells, is expressed in acute leukaemia cells 
JOURNAL Br. J, Haematol. 102 (5), 1376-1379 (1998) 
MEDLINE 98423 996 
PUBMED 9753 074 
REFERENCE 4 (bases 1 to 2148) 

AUTHORS Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins J] 
Bruskiewich R, Beare DM, Clamp M, Smink LJ, Ainscough 

R, Almeida 

JP, Babbage A, Bagguley C, Bailey J, Barlov/ K, Bates 
KN, Beasley O, 

Bird CP, Blakey S, Bridgeraan AM, Buck D, Burgess J, 

Burr ill WD, 

O'Brien KP and et al. 
TITLE The DNA sequence of human chromosome 22 
JOURNAL Nature 402 (6761), 489-495 (1999) 
MEDLINE 20057165 
PUBMED 10591208 
REMARK Erratum: C (published errattun appears in Nature 2000 Apr 

20;404(6780) :904]] 

REVIEWED REFSEQ: This record has been curated by NCBI 



COMMENT 
Staff. The 



amino acid 
a human 



reference sequence was derived from U65011.1. 
Summary: The protein encoded by this gene has a 509 

antigen, lacking a signal sequence, and recognized on 
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melanoma cell line by a T-lymphocyte clone. A 
significant level of 

this inRNA is detected in notinal testis as well as m 

many 

melanomas, non-small cell lung carcinomas, sarcomas, 

head and neck , 
tumors and renal carcinomas. The encoded protein is 

expressed . 

predominantly in acute leukemias carrying chromosomal 

abnormalities 

such as translocation t(8:21), which fuses the AMLl 

and ETO genes, ^ . . 

' Its e3cpression shares several characteristics with the 

expression ^ . , . n -. ^ 

patterns of MAGE, BAGE, and GAGE gene families, all of 

which are , 

expressed in tumors. This protein is expressed m a 

higher 

proportion of samples than genes of the MAGE, BAGE, 

and GAGE 

families, and of these four groups, only this protein 

is expressed 

by acute myeloid leukemias , 
COMPLETENESS: complete on the 3' end. 
FEATURES Location/Qualif iers 

source 1,.2148 

/organism="Homo sapiens" 
/db_xref="taxon:9606" 
/chromosome* "22" 
/map=*22qll.22'' 

/haplotype=*A24 A28 B13 B44 Cw6 Cw?" 
gene 1..2148 

/gene= "FRAME'' 

/note="MAPE; 01 P4" 

/ db_xr e f = " Lo cu s I D : 2 3 5 3 2 " 

/ db_xref = ^^MIM : 6 0 6 02 1" 
CDS 236.. 1765 

/gene=^^PRAME" 

/note=*'melanoma antigen preferentially 



eiqpressed in 



melanoma" 



tumors; Opa- interacting protein 0IP4" 
/ codon__s tart = 1 
/db_xref = **LocusID : 2 3 5 3 2 " 
/db_xref='WM: 606021" 

/product="preferentially expressed antigen of 

yprotein_id="NP_006106 . 1" 
/db_xref ='^GI : 5174641" 



/ trans la t ion= ^^MERRRLWGSIQSRYI SMSVWTSPRRLVELAGQSLLKDEAIiAIAALELLPREL 
FPPLFMAAFDGRHSQTLKAMVQAWPFTCLPLGVLMKGQHLHLETFKAVLDGLDVLI^ 
LQVIiDLRKNSHQDFWTWSGNRASLYSFPEPEAAQPMTKKRKVDGLSTEAEQPFIPVEVLTO 
EGAODELFSYLIEKVKRKKNVLRLCCKKLKIFAMPMQDIKMILKMVQLDSIEDLEVTCTW 
KFSPYLGQMINLRRLLLSHIHASSYISPEKEEQYIAQFTSQFLSLQCLQALYVDSLPFLRGRLDQL 
LRHVMNPLETLSITNCRLSEGDVMHLSQSPSVSQLSVLSLSGVMLTDVSPEPLQALLERASATLQD 
LVFDECGITDDQLLALLPSLSHCSQLTTLSFYGNSISISALQSLLQHLIGIiSKfLTHVLYPVPLESY 
EDIHGTLHLERLAYLHARLRELLCELGRPSMVWLSANPCPHCGDRTFYDPEPILCPCFMPN" 
variation 254 
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variation 



variation 



variation 



variation 



variation 



variation 



variation 



polyA_signal 
polyA_site 



/allele="C'' 
/alleles "T" 

/db_xref=^*cabSNP: 1129172" 
698 

/allele="C" 
/allele=^A'' 

/db^xref = ^dbSNP : 1063107" 

1001 

/allele=^^C" 
/allele=^^T" 

/db_xref=«dbSNP:2229695" 
1498 

/alleles '^T" 
/allele^^C 

/db_xref="dbSNP: 2229596" 
1762 

/allele="C" 
/allele="T" 

/db_xref=^dbSNP: 13604" 
1952 

/allele=^^A" 
/alleles "G" 
/db^xref =*^dbSNP : 7104" 

1984 

/allele=«A" 
/allele=^C" 
/db_xref ="clbSNP: 8405" 

2043 

/allele=^^T" 
/allele=«A" 

/cab_xref="dbSNP: 1804685" 
2108. .2113 
2130 



BASE COUNT" 534 a 548 c 558 g 508 t 

^^^^"^^ 1 gcttcagggt acagctcccc cgcagccaga agccgggcct gcagcccctc 
agcaccgcto^^^^^^^^^^ cacccgcttc ccaggcgtga cctgtcaaca gcaacttcgc 
^^^^'^?5?\ctctctgag gaaaaaccat tttgattatt actctcagac gtgcgtggca 
^^^^^''Sf gagacctaga aatccaagcg ttggaggtcc tgaggccagc ctaagtcgct 
^^^^^^gf ^egaaggcgt ttgtggggtt ccattcagag ccgatacatc agcatgagtg 
tgtggacaag^^^^^^^^^^ ^ttgtggagc tggcagggca gagcctgctg aaggatgagg 
ccctggccat ^^^^^^^^^^ gagttgctgc ccagggagct cttcccgcca ctcttcatgg 
cagcctttga^^^^^^^^^^ agccagaccc tgaaggcaat ggtgcaggcc tggcccttca 
cctgcctccc^^^^^^^^^^ ctgatgaagg gacaacatct tcacctggag accttcaaag 
ctgtgcttga^^^^^^^^^^ gtgctccttg cccaggaggt tcgccccagg aggtggaaac 
ttcaagtgct^^^^^^^^^^ aagaactctc atcaggactt ctggactgta tggtctggaa 
acagggccag 
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661 tctgtactca tttccagagc cagaagcagc tcagcccatg acaaagaagc 
gaaaagtaga 

721 tggtttgagc acagaggcag agcagccctt cattccagta gaggtgctcg 
tagacctgtt 

5 781 cctcaaggaa ggtgcctgtg atgaattgtt ctcctacctc attgagaaag 

tgaagcgaaa 

841 gaaaaatgta ctacgcctgt gctgtaagaa gctgaagatt tttgcaatgc 
ccatgcagga 

901 tatcaagatg atcctgaaaa tggtgcagct ggactctatt gaagatttgg 
10 aagtgacttg 

961 tacctggaag ctacccacct tggcgaaatt ttctccttac ctgggccaga 

tgattaatct 

1021 gcgtagactc ctcctctccc acatccatgc atcttcctac atttccccgg 
agaaggaaga 

15 1081 gcagtatatc gcccagttca cctctcagtt cctcagtctg cagtgcctgc 

aggctctcta 

1141 tgtggactct ttatttttcc ttagaggccg cctggatcag ttgctcaggc 
acgtgatgaa 

1201 ccccttggaa accctctcaa taactaactg ccggctttcg gaaggggatg 
20 tgatgcatct 

1261 gtcccagagt cccagcgtca gtcagctaag tgtcctgagt ctaagtgggg 
tcatgctgac 

1321 cgatgtaagt cccgagcccc tccaagctct gctggagaga gcctctgcca 
ccctccagga 

25 1381 cctggtcttt gatgagtgtg ggatcacgga tgatcagctc cttgccctcc 

tgccttccct 

1441 gagccactgc tcccagctta caaccttaag cttctacggg aattccatct 
ccatatctgc 

1501 cttgcagagt ctcctgcagc acctcatcgg gctgagcaat ctgacccacg 
30 tgctgtatcc 

1561 tgtccccctg gagagttatg aggacatcca tggtaccctc cacctggaga 

ggcttgccta 

1621 tctgcatgcc aggctcaggg agttgctgtg tgagttgggg cggcccagca 

tggtctggct 

35 1681 tagtgccaac ccctgtcctc actgtgggga cagaaccttc tatgacccgg 

agcccatcct 

1741 gtgcccctgt ttcatgccta actagctggg tgcacatatc aaatgcttca 

ttctgcatac 

1801 ttggacacta aagccaggat gtgcatgcat cttgaagcaa caaagcagcc 
40 acagtttcag 

1861 acaaatgttc agtgtgagtg aggaaaacat gttcagtgag gaaaaaacat 
tcagacaaat 

1921 gttcagtgag gaaaaaaagg ggaagttggg gataggcaga tgttgacttg 
aggagttaat 

45 1981 gtgatctttg gggagataca tcttatagag ttagaaatag aatctgaatt 

tctaaaggga 

2041 gattctggct tgggaagtac atgtaggagt taatccctgt gtagactgtt 
gtaaagaaac 

2101 tgttgaaaat aaagagaagc aatgtgaagc aaaaaaaaaa aaaaaaaa 
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ED-B domain of Fibronectin 



LOCUS HSFIBEDB 2823 bp DNA linear 

PRI 09 -AUG- 1999 
5 DEFINITION H;iman fibronectin gene ED-B region. 

ACCESSION X07717 
VERSION X07717.1 GI: 31406 

KEYWORDS alternate splicing; fibronectin. 
SOURCE hiiman. 
10 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; 

Matranalia; Eutheria; Primates; Catarrhini; Hotninidae; 

Homo. 

15 REFERENCE 1 (bases 1 to 2823) 

AUTHORS Paolella.G., Henchclif f e,C. , Sebastio^G. and 

Baralle^F.E. 

TITLE Sequence analysis and in vivo expression show that 
alternative 

20 splicing of ED-B and ED-A regions of the human 

fibronectin gene are 

independent events 
JOURNAL Nucleic Acids Res. 16 (8), 3545-3557 (1988) 
MEDLINE 88233940 
25 FEATURES Location/Qualifiers 

source 1..2823 

/organism="Homo sapiens" 
/db_xref = " taxon : 96 06 " 
/clone=:"MA10" 
30 exon 1 . . 104 

/numbers 1 

/products " fibronectin" 
CDS join{<2. . 104 , 1375 1647 , 2758 . .>2823) 

/ c odon_s t art = 1 
35 /product =" fibronectin" 

/protein_id= "CAB52437 . 1" 
/db_xref ="GI : 5725425 " 

/translations " CTFDNLS PGLEYNVSVYTVKDDKESVPISDTI I PEVPQLTDLSF 
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VDITDSSIGLRWTPLNSSTIIGYRITWAAGEGIPIFEDFVDSSVGYYTVTGLEPGID 

YDISVITLINGGESAPTTLTQQTAVPPPTDLRFTNIGPDTMRVTW" 
5 intron 105.. 13 74 

/nun±>er=l 
exon 1375.. 1647 

/note=''ED-B exon" 
/nuTnber=2 

I Q /products " f ibronect in " 

intron 1648.. 2757 

/nuinber=2 
exon 2758. .2823 

/number =3 

][5 /product^ " f ibronect in" 

BASE COUNT 824 a 556 C 528 g 915 t 

ORIGIN 

1 ctgcactttt gataacctga gtcccggcct ggagtacaat gtcagtgttt 
acactgtcaa 

20 61 ggatgacaag gaaagtgtcc ctatctctga taccatcatc ccaggtaata 

gaaaataagc 

121 tgctatcctg agagtgacat tccaataaga gtggggatta gcatcttaat 
ccccagatgc 

181 ttaagggtgt caactatatt tgggatttaa ttccgatctc ccagctgcac 
25 tttccaaaac 

241 caagaagtca aagcagcgat ttggacaaaa tgcttgctgt taacactgct 

ttactgtctg 

3 01 tgcttcactg ggatgctgtg tgttgcagcg agtatgtaat ggagtggcag 
ccatggcttt 

30 361 aactctgtat tgtctgctca catggaagta tgactaaaac actgtcacgt 

gtctgtactc 

421 agtactgata ggctcaaagt aatatggtaa atgcatccca tcagtacatt 
tctgcccgat 

481 tttacaatcc atatcaattt ccaacagctg cctatttcat cttgcagttt 
35 caaatccttc 

541 tttttgaaaa ttggatttta aaaaaaagtt aagtaaaagt cacaccttca 

gggttgttct 

601 ttcttgtggc cttgaaagac aacattgcaa aggcctgtcc taaggatagg 
cttgtttgtc 
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661 cattgggtta taacataatg aaagcattgg acagatcgtg tccccctttg 
gactcttcag 

721 tagaatgctt ttactaacgc taattacatg ttttgattat gaatgaacct 
aaaatagtgg 

781 caatggcctt aacctaggcc tgtctttcct cagcctgaat gtgcttttga 
atggcacatt 

841 tcacaccafca cattcataat gcattagcgt tatggccatg atgttgtcat 
gagttttgta 

901 tgggagaaaa aaaatcaatt tatcacccat ttattatttt ttccggttgt 
tcatgcaagc 

961 ttattttcta ctaaaacagt tttggaatta ttaaaagcat tgctgatact 
tacttcagat 

1021 attatgtcta ggctctaaga atggtttcga catcctaaac agccatatga 
tttttaggaa 

1081 tctgaacagt tcaaattgta ccctttaagg atgttttcaa aatgtaaaaa 
atatatatat 

1141 atatatatat tccctaaaag aatattcctg tttattcttc tagggaagca 
aactgttcat 

1201 gatgcttagg aagtcttttc agagaattta aaacagattg catattacca 
tcattgcttt 

1261 aacattccac caattttact actagtaacc tgatatacac tgctttattt 
tttcctcttt 

1321 ttttccctct attttccttt tgcctccccc tccctttgct ttgtaactca 
atagaggtgc 

1381 cccaactcac tgacctaagc tttgttgata taaccgattc aagcatcggc 
ctgaggtgga 

1441 ccccgctaaa ctcttccacc attattgggt accgcatcac agtagttgcg 
gcaggagaag 

1501 gtatccctat ttttgaagat tttgtggact cctcagtagg atactacaca 
gtcacagggc 

1561 tggagccggg cattgactat gatatcagcg ttatcactct cattaatggc 
ggcgagagtg 

1621 cccctactac actgacacaa caaacgggtg aattttgaaa acttctgcgt 
ttgagacata 

1681 gatggtgttg catgctgcca ccagttactc cggttaaata tggatgtttc 
atgggggaag 

1741 tcagcaattg gccaaagatt cagataggtg gaattggggg gataaggaat 
caaatgcatc 
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1801 tgctaaactg attggagaaa aacacatgca atatcttcag tacactctca 
tttaaaccac 

1861 aagtagatat aaagcctaga gaaatacaga tgtctgctct gttaaatata 
aaatagcaaa 

5 1921 tgttcattca atttgaagac ctagaatttt tcttcttaaa taccaaacac 

gaataccaaa 

1981 ttgcgtaagt accaattgat aagaatatat caccaaaatg taccatcatg 
ctcttccttc 

2041 taccctttga taaactctac catgctcctt ctttgtagct aaaaacccat 
10 caaaatttag 

2101 ggtagagtgg atgggcattg ttttgaggta ggagaaaagt aaacttggga 
ccattctagg 

2161 ttttgttgct gtcactaggt aaagaaacac ctctttaacc acagtctggg 
gacaagcatg 

15 2221 caacatttta aaggttctct gctgtgcatg ggaaaagaaa catgctgaga 

accaatttgc, 

2281 atgaacatgt tcacttgtaa gtagaattca ctgaatggaa ctgtagctct 
agatatctca 

2341 catgggggga agtttaggac cctcttgtct ttttgtctgt gtgcatgtat 
20 ttctttgtaa 

2401 agtactgcta tgtttctctt tgctgtgtgg caacttaagc ctcttcggcc 
tgggataaaa 

2461 taatctgcag tggtattaat aatgtacata aagtcaacat atttgaaagt 
agattaaaat 

25 2521 cttttttaaa tatatcaatg atggcaaaaa ggttaaaggg ggcctaacag 

tactgtgtgt 

2581 agtgttttat ttttaacagt agtacactat aacttaaaat agacttagat 
tagactgttt 

2641 gcatgattat gattctgttt cctttatgca tgaaatattg attttacctt 
30 tccagctact 

2701 tcgttagctfc taattttaaa atacattaap tgagtcttcc ttcttgttcg 
aaaccagctg 

2761 ttcctcctcc cactgacctg cgattcacca acattggtcc agacaccatg 
cgtgtcacct 
35 2821 ggg 

// 
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CEA 

LOCUS NM_0043 63 2974 bp mRNA linear 

PRI 28-NOV-2000 

DEFINITION Homo sapiens carcinoembryonic antigen- related cell 
adhesion 

molecule 5 (CEACAM5) , mRNA. 

ACCESSION NM_004363 

VERSION NM_004363.1 GI: 11386170 

KEYWORDS 

SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostorai ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 2974) 

AUTHORS Oikawa S, Nakazato H and Kosaki G. 

TITLE Primary structure of human carcinoembryonic antigen 

(CEA) deduced 

from cDNA sequence 
JOURNAL Biochem. Biophys. Res. Comraun. 142 (2), 511-518 (1987) 
MEDLINE 87128144 
PUBMED 3814146 
REFERENCE 2 (bases 1 to 2974) 

AUTHORS Zimmermann W, Weber B, Ortlieb B, Rudert F, Schempp W, 

Fiebig HH, 

Shively JE, von Kleist S and Thompson JA. 
TITLE Chromosomal localization of the carcinoembryonic 

antigen gene 

family and differential expression in various tumors 
JOURNAL Cancer Res. 48 (9), 2550-2554 (1988) 
MEDLINE 88184584 
PUBMED 3356015 
REFERENCE 3 (bases 1 to 2974) 

AUTHORS Bamett,T., Goebel,S.J.. Nothdurf t ,M.A. and 

Elting, J. J. 

TITLE Carcinoembryonic antigen family: characterization of 

cDNAs coding 



-195- 



wo 2003/008537 



PCT/US2002/010189 



for NCA and CEA and suggestion of nonreu:idom sequence 

variation in 

their conserved loop-domains 
JOURNAL Genomics 3 (1), 59-66 - (1988) 
5 MEDLINE 89122014 

REFERENCE 4 (bases 1 to 2974) 

AUTHORS Barnett T and Zimmermann W. 

TITLE Workshop report: proposed nomenclature for the 

carcinoembiryonic 
10 antigen (CEA) gene family 

JOURNAL Tumoxir Biol. 11 (1-2), 59-63 (1990) 
MEDLINE 90176333 
PUBMED 2309067 
REFERENCE 5 (bases 1 to 2974) 
15 AUTHORS Schrewe H, Thompson J, Bona Hefta LJ, Maruya A, 

Hassauer 

Shively JE, von Kleist S and Zimmermann W. 
TITLE Cloning of the complete gene for carcinoembryonic 

ant i gen : analy s i s 

20 of its promoter indicates a region conveying cell 

type-specific 

expression 

JOURNAL Mol, Cell. Biol, 10 (6), 2738-2748 (1990) 
MEDLINE 90258861 
25 PUBMED 2342461 

COMMENT PROVISIONAL REFSEQ: This record has not yet been 

siibject to final 

NCBI review. The reference sequence was derived from 

M29540.1. 

30 FEATURES Ijocation/Qualif iers 

source 1..2974 

/organism="Homo sapiens" 

/db_xref = " taxon : 96 06 " 

/ chr omosome= "19" 
35 /map="19ql3 . l-ql3 . 2 « 

gene 1..2974 

/gene="CEACAM5" 

/note="CEA; CD66e" 

/db_xref="LocusID:1048" 
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/db_xref="MIM: 114890" 
115- .2223 
/gene="CEACAM5" 

/note="carcinoembryonic antigen" 
/codon__start=l 
/db_xref = "LocusID : 1048 " 
/db_xref="MIM: 114890" 

/product="carcinoenibryonic antigen- related 



/translation="MESPSAPPHRWCIPWQM,U.TASLLTFWNPPTTAKLTIESTPFN 
VAEGKEVLriVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIY 
PNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPSISSNNSKPVEDK 
DAVAFTCEPETQDATYLWWVimQSLPVSPRLQLSNGNRTLTLFNVTRlJDTASYKCETQ 
NPVSARRSDSVILNVLYGPDAPTISPLNTSYRSGENL^SCHAASNPPAQYSWFVNGT 
FQQSTQELFIPNITVmSGSYTCOAHNSDTGI^TTVTTITVYAEPPKPF 

VEDEDAVALTCEPEIQNTTYLWWVl^QSLPVSPRMLSiroNRTLT^^ 

CGIQNELSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWL 

IDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSN 

NSKPVEDKDAVAFTCEPEAQNTTTLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRiro 

RAYVCGlQNSVSANRSDPWHDVLYGPDTPlISPPDSSYLSGANIiNLSCH 

YSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGTSPG 



cell adhesion 



molecule 5" 

/protein_id="NP_004354 . 1" 
/db__xref="GI: 113 86171" 



LSAGATVGIMIGVLVGVALI " 



misc feature 



595.. 780 

/note="IG; Region: Immunoglobulin" 



misc feature 



raise feature 



misc feature 



misc feature 



868. .1026 

/note="IG; Region: Immunoglobulin" 
1129. .1314 

/note="IG; Region: Immunoglobulin" 
1399. .1566 

/note="IG; Region: Immunoglobulin" 
1930. .2091 

/note="IG; Region: Immunoglobulin" 



BASE COUNT 



840 a 



847 c 613 g 674 t 



ORIGIN 
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1 ctcagggcag agggaggaag 
gacaaaacgt 

61 tcctggaact caagctcttc 
gaccatggag 
5 121 tctccctcgg cccctcccca 

gctcacagcc 

181 tcacttctaa ccttctggaa 
atccacgccg 

241 ttcaatgtcg cagaggggaa 
10 ccagcatctt 

301 tttggctaca gctggtacaa 
tataggatat 

361 g t aataggaa c tcaacaagc 
gataatatac 
15 421 cccaatgcat ccctgctgat 

ctacacccta 

481 cacgtcataa agtcagatct 
ggtatacccg 

541 gagctgccca agccctccat 
20 caaggatgct 

601 gtggccttca cctgtgaacc 
ggtaaacaat 

6 61 cagagcctcc cggtcagtcc 
cctcactcta 
25 721 ttcaatgtca caagaaatga 

cccagtgagt 

781 gccaggcgca gtgattcagt 
ccccaccatt 

841 tcccctctaa acacatctta 
30 ccacgcagcc 

901 tctaacccac ctgcacagta 
atccacccaa 

961 gagctcttta tccccaacat 
ccaagcccat 
35 1021 aactcagaca ctggcctcaa 

tgcagagcca 

1081 cccaaaccct tcatcaccag 
tgctgtagcc 



PCT/US2002/010189 

gacagcagac cagacagtca cagcagcctt 
tccacagagg aggacagagc agacagcaga 
cagatggtgc atcccctggc agaggctcct 
cccgcccacc actgccaagc tcactattga 
ggaggtgctt ctacttgtcc acaatctgcc 
aggtgaaaga gtggatggca accgtcaaat 
taccccaggg cccgcataca gtggtcgaga 
ccagaacatc atccagaatg acacaggatt 
tgtgaatgaa gaagcaactg gccagttccg 
ctceagcaac aactccaaac ccgtggagga 
tgagactcag gacgcaacct acctgtggtg 
caggctgcag ctgtccaatg gcaacaggac 
cacagcaagc tacaaatgtg aaacccagaa 
catcctgaat gtcctctatg gcccggatgc 
cagatcaggg gaaaatctga acctctcctg 
ctcttggttt gtcaatggga ctttccagca 
cactgtgaat aatagtggat cctatacgtg 
taggaccaca gtcacgacga tcacagtcta 
caacaactcc aaccccgtgg aggatgagga 
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1141 ttaacctgtg aacctgagat tcagaacaca acctacctgt ggtgggtaaa 
taatcagagc 

1201 ctcccggtca gtcccaggct gcagctgtcc aatgacaaca ggaccctcac 
tctactcagt 

5 1261 gtcacaagga atgatgtagg accctatgag tgtggaatcc agaacgaatt 

aagtgttgac 

1321 cacagcgacc cagtcatcct gaatgtcctc tatggcccag acgaccccac 
catttccccc 

1381 tcatacacct attaccgtcc aggggtgaac ctcagcctct cctgccatgc 
10 agcctctaac 

1441 ccacctgcac agtattcttg gctgattgat gggaacatcc agcaacacac 

acaagagctc 

1501 tttatctcca acatcactga gaagaacagc ggactctata cctgccaggc 
caataactca 

15 1561 gccagtggcc acagcaggac tacagtcaag acaatcacag tctctgcgga 

gctgcccaag 

1621 ccctccatct ccagcaacaa ctccaaaccc gtggaggaca aggatgctgt 
ggccttcacc 

1681 tgtgaacctg aggctcagaa cacaacctac ctgtggtggg taaatggtca 
20 gagcctccca 

1741 gtcagtccca ggctgcagct gtccaatggc aacaggaccc tcactctatt 
caatgtcaca 

1801 agaaatgacg caagagccta tgtatgtgga atccagaact cagtgagtgc 
aaaccgcagt 

25 1861 gacccagtca ccctggatgt cctctatggg ccggacaccc ccatcatttc 

ccccccagac 

1921 tcgtcttacc tttcgggagc gaacctcaac ctctcctgcc actcggcctc 
taacccatcc 

1981 ccgcagtatt cttggcgtat caatgggata ccgcagcaac acacacaagt 
30 tctctttatc 

2041 gccaaaatca cgccaaataa taacgggacc tatgcctgtt ttgtctctaa 
cttggctact 

2101 ggccgcaata attccatagt caagagcatc acagtctctg catctggaac 
ttctcctggt 

35 2161 ctctcagctg gggccactgt cggcatcatg attggagtgc tggttggggt 

tgctctgata 

2221 tagcagccct ggtgtagttt cttcatttca ggaagactga cagttgtttt 
gcttcttcct 
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2281 taaagcattt gcaacagcta cagtctaaaa ttgcttcttt accaaggata 
tttacagaaa 

2341 agactctgac cagagatcga gaccatccta gccaacatcg tgaaacccca 
tctctactaa 

2401 aaatacaaaa atgagctggg cttggtggcg cgcacctgta gtcccagtta 
ctcgggaggc 

2461 tgaggcagga gaatcgcttg aacccgggag gtggagattg cagtgagccc 
agatcgcacc 

2521 actgcactcc agtctggcaa cagagcaaga ctccatctca aaaagaaaag 
aaaagaagac 

2581 tctgacctgt actcttgaat acaagtttct gataccactg cactgtctga 
gaatttccaa 

2641 aactttaatg aactaactga cagcttcatg aaactgtcca ccaagatcaa 
gcagagaaaa 

2701 taattaattt catgggacta aatgaactaa tgaggattgc tgattcttta 
aatgtcttgt 

2761 ttcccagatt tcaggaaact ttttttcttt taagctatcc actcttacag 
caatttgata 

2821 aaatatactt ttgtgaacaa aaattgagac atttacattt tctccctatg 
tggtcgctcc 

2881 agacttggga aactattcat gaatatttat attgtatggt aatatagtta 
ttgcacaagt 

2941 tcaataaaaa tctgctcttt gtataacaga aaaa 

// 



Her2/Neu 

LOCUS HX3MHER2A 4530 bp mRNA linear 

PRI 18-SEP-1995 

DEFINITION Human tyrosine kinase- type receptor (HER2) mRNA, 

complete cds . 

ACCESSION M11730 

VERSION M11730.1 GI: 183 986 

KEYWORDS tyrosine kinase. 

SOURCE Homo sapiens {clone: lambda-HER2-436) fetal cDNA to 

mRNA. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; 

Euteleostorai; 
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Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo- 

REFERENCE 1 (bases 1 to 4530) 

AUTHORS Coussens.L., Yang-Feng ^ T. L. , Ijiao,Y.-C., Chen,E,, 

Gray, A. , 

McGra th , J . , Seeburg , P . H , , Libermann , T . A . , 

Schlessinger, J. , 

Francke , U . / Levinson, A . and Ullr ich , A . 
TITLE Tyrosine kinase receptor with extensive homology to 

EGF receptor 

shares chromosomal location with neu oncogene 
JOURNAL Science 230 (4730), 1132-1139 (1985) 
MEDLINE 86070181 
REFERENCE 2 (bases 1701 to 1719) 
AUTHORS Ullrich, A. 
JOURNAL Unpublished (1988) 
FEATURES Location/Qualif iers 

source 1,.4530 

/organism="Homo sapiens" 
/db_xref = " taxon : 96 06 " 
/clone="lambda-HER2-436" 
/dev_s tage= " fetal " 
mRlSIA <1.,4530 

/product ="HER2 mRNA" 
CDS 151., 3918 

/note="HER2 receptor" 
/ codon_s t ar t = 1 
/protein_id="AAA75493 .1" 
/db_xref="GI:306840" 

/translation=«MELAALCRWGLLIiALLPPGAASTQVCTGTDMKLRLPASPETHLD 
MLRHLYQGCQWQGNLELTYLPTNASLSFLQDIQEVQGYVLIAHNQVRQVPLQRLRIV 
RGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQ 
LCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRT 
VCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVTYNT 
DTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKC 
SKPCARVCYGLGMEHLRBVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPL 
QPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGI 
SWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEG 
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lACHQLCARGHCWGPGPTQCWCSQFLRGQECVEECRVLQGLPREYVNARHCLPC^ 

CQPQNGSVTCFGPBADQCn/ACAHyKDPPFCVi^CPSGVKPDI^ 

PCPINCrrHSCM)LDDKGCPAEQRASPLTSIVSAWGILLVVVIiGVVFGILIK^ 

rkytmrrllqetelvepltpsgampnqaqmrilketelrkvkvlgsg;^^ 
5 pdgenvkipvaikvlrkotspkankeiijdeayvmagvgspyvsr^^ 

qlmpygclldhvrenrgrlgsqdllnwcmqiakgmsyledvrlvhrdlaar 

NHVKITDFGLARIiLDIDETEYHADGGKVPIKWMALESILRRRFTHQSDWSYG^^ 

LMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKOflM 

VSEFSRMARDPQRFWIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGF 

10 FCPDPAPGAGCMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDG 
DLO^GAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVR 
PQPPSPREGPLPA?UlPAGATIiERAKTLSPGKNGVVKDVFAFGGAVENPEYLTPQGGAA 
PQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV" 
old^sequence 1701 . . 1719 

15 /citation=[l] 

BASE COUNT 922 a 1382 c 1346 g 880 t 

ORIGIN Chromosome 17q21--q22. 

1 aattctcgag ctcgtcgacc ggtcgacgag ctcgagggtc gacgagctcg 
agggcgcgcg 

20 61 cccggccccc acccctcgca gcaccccgcg ccccgcgccc tcccagccgg 

gtccagccgg 

121 agccatgggg ccggagccgc agtgagcacc atggagctgg cggccttgtg 
ccgctggggg 

181 ctcctcctcg ccctcttgcc ccccggagcc gcgagcaccc aagtgtgcac 
25 cggcacagac 

241 atgaagctgc ggctccctgc cagtcccgag acccacctgg acatgctccg 
ccacctctac 

3 01 cagggctgcc aggtggtgca gggaaacctg gaactcacct acctgcccac 
caatgccagc 

30 361 ctgtccttcc tgcaggatat ccaggaggtg cagggctacg tgctcatcgc 

tcacaaccaa 

421 gtgaggcagg tcccactgca gaggctgcgg attgtgcgag gcacccagct 
ctttgaggac 

481 aactatgccc tggccgtgct agacaatgga gacccgctga acaataccac 
35 ccctgtcaca 

541 ggggcctccc caggaggcct gcgggagctg cagcttcgaa gcctcacaga 
gatcttgaaa 

601 ggaggggtct tgatccagcg gaacccccag ctctgctacc aggacacgat 
tttgtggaag 
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661 gacatcttcc acaagaacaa 
ccgctctcgg 

721 gcctgccacc cctgttctcc 
gagttctgag 
5 781 gattgtcaga gcctgacgcg 

caaggggcca 

841 ctgcccactg actgctgcca 
caagcactct 

901 gactgcctgg cctgcctcca 
10 ctgcccagcc 

961 ctggtcacct acaacacaga 
ccggtataca 

1021 ttcggcgcca gctgtgtgac 
cgtgggatcc 
15 1081 tgcaccctcg tctgccccct 

aacacagcgg 

1141 tgtgagaagt gcagcaagcc 
ggagcacttg 

1201 cgagaggtga gggcagttac 
20 caagaagatc 

1261 tttgggagcG tggcatttct 
caacactgcc 

1321 ccgctccagc cagagcagct 
aggttaccta 
25 1381 tacatctcag catggccgga 

cctgcaagta 

1441 atccggggac gaattctgca 
gctgggcatc 

1501 agctggctgg ggctgcgctc 
30 catccaccat 

1561 aacacccacc tctgcttcgt 
gaacccgcac 

1621 caagctctgc tccacactgc 
gggcctggcc 
35 1681 tgccaccagc tgtgcgcccg 

gtgtgtcaac 

1741 tgcagccagt tccttcgggg 
gcaggggctc 



ccagctggct ctcacactga tagacaccaa 

gatgtgtaag ggctcccgct gctggggaga 

cactgtctgt gccggtggct gtgcccgctg 

tgagcagtgt gctgccggct gcacgggccc 

cttcaaccac agtggcatct gtgagctgca 

cacgtttgag tccatgccca atcccgaggg 

tgcctgtccc tacaactacc tttctacgga 

gcacaaccaa gaggtgacag cagaggatgg 

ctgtgcccga gtgtgctatg gtctgggcat 

cagtgccaat atccaggagt ttgctggctg 

gccggagagc tttgatgggg acccagcctc 

ccaagtgttt gagactctgg aagagatcac 

cagcctgcct gacctcagcg tcttccagaa 

caatggcgcc tactcgctga ccctgcaagg 

actgagggaa ctgggcagtg gactggccct 

gcacacggtg ccctgggacc agctctttcg 

caaccggcca gaggacgagt gtgtgggcga 

agggcactgc tggggtccag ggcccaccca 

ccaggagtgc gtggaggaat gccgagtact 
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1801 cccagggagt atgtgaatgc 
tcagccccag 

1861 aatggctcag tgacctgttt 
tgcccactat 
5 1921 aaggaccctc ccttctgcgt 

cctctcctac 

1981 atgcccatct ggaagtttcc 
catcaactgc 

2041 acccactcct gtgtggacct 
10 agccagccct 

2101 ctgacgtcca tcgtctctgc 

2161 tttgggatcc tcatcaagcg 
gcggagactg 
15 2221 ctgcaggaaa cggagctggt 

caaccaggcg 

2281 cagatgcgga tcctgaaaga 
atctggcgct 

2341 tttggcacag tctacaaggg 
20 aattccagtg 

2401 gccatcaaag tgttgaggga 
cttagacgaa 

2461 gcatacgtga tggctggtgt 
catctgcctg 
25 2521 acatccacgg tgcagctggt 

agaccatgtc 

2581 cgggaaaacc gcggacgcct 
gcagattgcc 

2641 aaggggatga gctacctgga 
30 cgctcggaac 

2701 gtgctggtca agagtcccaa 
tcggctgctg 

2761 gacattgacg agacagagta 
gtggatggcg 
35 2821 ctggagtcca ttctccgccg 

ttatggtgtg 

2881 ac t g tgtggg age tgatgac 
agcccgggag 



caggcactgt ttgccgtgcc accctgagtg 

tggaccggag gctgaccagt gtgtggcctg 

ggcccgctgc cccagcggtg tgaaacctga 

agatgaggag ggcgcatgcc agccttgccc 

ggatgacaag ggctgccccg ccgagcagag 

ggtggttggc attctgctgg tcgtggtctt 

acggcagcag aagatccgga agtacacgat 

ggagccgctg acacctagcg gagcgatgcc 

gacggagctg aggaaggtga aggtgcttgg 

catctggatc cctgatgggg agaatgtgaa 

aaacacatcc cccaaagcca acaaagaaat 

gggctcccca tatgtctccc gccttctggg 

gacacagctt atgccctatg gctgcctctt 

gggctcccag gacctgctga actggtgtat 

ggcitgtgcgg ctcgtacaca gggacttggc 

ccatgtcaaa attacagact tcgggctggc 

ccatgcagat gggggcaagg tgcccatcaa 

gcggttcacc caccagagtg atgtgtggag 

ttttggggcc aaaccttacg atgggatccc 
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2941 atccctgacc tgctggaaaa 
caccattgat 

3001 gtctacatga tcatggtcaa 
aagattccgg 
5 3061 gagttggtgt ctgaattctc 

ggtcatccag 

3121 aatgaggact tgggcccagc 
actgctggag 

3181 gacgatgaca tgggggacct 
10 gcagggcttc 

3241 ttctgtccag accctgcccc 
ccgcagctca 

3301 tctaccagga gtggcggtgg 
agaggaggcc 
15 3361 cccaggtctc cactggcacc 

tggtgacctg 

3421 ggaatggggg cagccaaggg 
ccctctacag 

3481 cggtacagtg aggaccccac 
20 cgttgccccc 

3541 ctgacctgca gcccccagcc 
ccagccccct 

3601 tcgccccgag agggccctct 
ggaaagggcc 
25 3661 aagactctct ccccagggaa 

tgggggtgcc 

3721 gtggagaacc ccgagtactt 
ccaccctcct 

3781 cctgccttca gcccagcctt 
30 accagagcgg 

3841 ggggctccac ccagcacctt 
gtacctgggt 

3901 ctggacgtgc cagtgtgaac 
tgtcctcagg 

35 3961 gagcagggaa ggcctgactt 

cgaccacttc 

4021 caggggaacc tgccatgcca 
cttgagttcc 



gggggagcgg ctgccccagc cccccatctg 

atgttggatg attgactctg aatgtcggcc 

ccgcatggcc agggaccccc agcgctttgt 

cagtcccttg gacagcacct tctaccgctc 

ggtggatgct gaggagtatc tggtacccca 

gggcgctggg ggcatggtcc accacaggca 

ggacctgaca ctagggctgg agccctctga 

ctccgaaggg gctggctccg atgtatttga 

gctgcaaagc ctccccacac atgaccccag 

agtacccctg ccctctgaga ctgatggcta 

tgaatatgtg aaccagccag atgttcggcc 

gcctgctgcc cgacctgctg gtgccactct 

gaatggggtc gtcaaagacg tttttgcctt 

gacaccccag ggaggagctg cccctcagcc 

cgacaacctc tattactggg accaggaccc 

caaagggaca cctacggcag agaacccaga 

cagaaggcca agtccgcaga agccctgatg 
ctgctggcat caagaggtgg gagggccctc 
ggaacctgtc ctaaggaacc ttccttcctg 
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4081 cagatggctg gaaggggtcc agcctcgttg gaagaggaac agcactgggg 
agtctttgtg 

4141 gattctgagg ccctgcccaa tgagactcta gggtccagtg gatgccacag 
cccagcttgg 

5 4201 ccctttcctt ccagatcctg ggtactgaaa gccttaggga agctggcctg 

agaggggaag 

4261 cggccctaag ggagtgtcta agaacaaaag cgacccattc agagactgtc 
cctgaaacct 

4321 agtactgccc cccatgagga aggaacagca atggtgtcag tatccaggct 

10 ttgtacagag 

4381 tgcttttctg tttagttttt actttttttg ttttgttttt ttaaagacga 

aataaagacc 

4441 caggggagaa tgggtgttgt atggggaggc aagtgtgggg ggtccttctc 
cacacccact 

15 4501 ttgtccattt gcaaatatat tttggaaaac 

// 



SCP-1 

20 LOCUS HSSCPIPRT 3393 bp mRNA linear 

PRI 26-FEB-1997 

DEFINITION H. sapiens mRNA for SCPl protein. 
ACCESSION X95654 
VERSION X95654.1 01:1212982 

25 KEYWORDS SCPl gene. 

SOURCE human . 

ORGANISM Homo sapiens 

Eiokaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

30 Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 3393) 

AUTHORS Meuwissen,R.L- , Meerts,!., Hoovers , J . M , , Leschot,N.J. 
and 

35 Heyting.C. 

TITLE Human synaptonemal complex protein 1 (SCPl) : isolation 

and 

characterization of the cDNA and chromosomal 
localization of the 
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gene 

JOURNAL Genomics 39 (3) , 377-384 (1997) 
MEDLINE 97224467 
REFERENCE 2 (bases 1 to 3393) 
5 AUTHORS Meuwissen.R. J-L. 

TITLE Direct Siibmission 

JOURNAL Submitted (13-FEB-1996) Dr. R.L.J. Meuwissen, 

Agricultural 

University, Genetics, Dreijenlaan 2, 6703 HA 

10 WAGENIN6EN, 

NETHERLANDS 
FEATURES Location/Qualifiers 
source 1, .3393 

/organi sm= " Homo sapiens " 
15 /db_xref = " taxon : 96 06 " 

/map="lpl2"pl3" 
/tis3ue_type= "testis" 
gene 95 ..3 025 

/gene="SCPl" 
20 CDS 95.. 3025 

/gene="SCPl" 
/codon_start=l 
/evidence=experiraental 
/products "polypeptide of 976 aa" 
25 /protein_id="CAA64956 .1" 

/db_xref="GI:1212983" 
/db^xref = " SWISS -PROT : Q154 31 " 

/ trans 1 a t ion= " MEKQKPFALFVPPRSSSSQVSAVKPQTLGGDSTFFKSFNKCTED 
30 DLEFPFAKTNLSKNGENIDSDPALQKVNFLPVLEQVGNSDCHYQEGLKDSDLENSEGL 
SRVFSKLYKEAEKIKKWKVSTEAELRQKESKLQENRKIIEAQRKAIQELQFGNEKVSL 
KLEEGIQENKDLIKENNATRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLNNNI 
EKMITAHGELRVQAENSRLEMHFKLKEDYEKIQHLEQEYKKEINDKEKQVSLLLIQIT 
EKENKMKDLTFLLEESRDKVNQLEEKTKLQSENLKQSIEKQHHLTKELEDIKVSLQRS 
35 VSTQKALEEDLQIATKTICQLTEEKETQMEESNKARAAHSPWTEPETTVCSLEELLR 
TEQQRLEKNEDQLKILTMELQKKSSELEEMTKLTNNKEVELEELKKVLGEKETLLY^ 
KQFEKIAEELKGTEQELIGLLQAREKEVHDLEIQLTAITTSEQYYSKEVKDLKTELEN 
EKLKNTELTSHCNKLSLENKELTQETSDMTLBLKNQQEDINNNKKQEERMLKQIENLQ 
ETETQLRNELEYVREELKQKRDEVKCKLDKSEENCNNLRKQVENKNKYIEELQQENKA' 
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LKKKGTAESKQUJVYEIKVNKLELELESAKQKFGEITDTYQKEIEDKKISEENLLEBV 

EKAKVI^U3EAVKLQKEIDKRCQHKIAE^WM-MBKHKHQYDKIIEBRDSELGLyKSKEQ 

EQSSLRMLBIBLSNLKAEIiSVKKQLEIBREEKEKLKREAKEOTATLKEKKDKKTQ 

FLLETPEIYVnCU)SKAVPSOrVSimPTSVDHGISKDKRDYLWTSAKOTLSTPLPKAYT 

VKTPTKPKLQQRENLNIPIEESKKKRKMAFBFDINSDSSETTDLLSMVSEEETLKTLY 

RNiraPPASHLCVKTPKKAPSSLTTPGPTLKPCJMRKMREDRWAVIi^RKKKLKEAE 

KLFV" 

BASE COUNT 1440 a 483 C 648 g 822 t 

ORIGIN 

1 gccctcatag accgtttgtt gtagttcgcg tgggaacagc aacccacggt 
ttcccgatag 

61 ttcttcaaag atatttacaa ccgtaacaga gaaaatggaa aagcaaaagc 
cctttgcatt 

121 gttcgtacca ccgagatcaa gcagcagtca ggtgtctgcg gtgaaacctc 
agaccctggg 

181 aggcgattcc actttcttca agagtttcaa caaatgtact gaagatgatt 
tggagtttcc 

241 atttgcaaag actaatctct ccaaaaatgg ggaaaacatt gattcagatc 
ctgctttaca 

301 aaaagttaat ttcttgcccg tgcttgagca ggttggtaat tctgactgtc 
actatcagga 

361 aggactaaaa gactctgatt tggagaattc agagggattg agcagagtgt 
tttcaaaact 

421 gtataaggag gctgaaaaga taaaaaaatg gaaagtaagt acagaagctg 
aactgagaca 

481 gaaagaaagt aagttgcaag aaaacagaaa gataattgaa gcacagcgaa 
aagccattca 

541 ggaactgcaa tttggaaatg aaaaagtaag tttgaaatta gaagaaggaa 
tacaagaaaa 

601 taaagattta ataaaagaga ataatgccac aaggcattta tgtaatctac 
tcaaagaaac 

661 ctgtgctaga tctgcagaaa agacaaagaa atatgaatat gaacgggaag 
aaaccaggca 

721 agtttatatg gatctaaata ataacattga gaaaatgata acagctcatg 
gggaacttcg 

781 tgtgcaagct gagaattcca gactggaaat gcattttaag ttaaaggaag 
attatgaaaa 

841 aatccaacac cttgaacaag aatacaagaa ggaaataaat gacaaggaaa 
agcaggtatc 
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901 actactattg atccaaatca 
catttctgct 

961 agaggaatcc agagataaag 
agagtgaaaa 
5 1021 cttaaaacaa tcaattgaga 

atattaaagt 

1081 gtcattacaa agaagtgtga 
agatagcaac 

1141 aaaaacaatt tgtcagctaa 
10 ctaataaagc 

1201 tagagctgct cattcgtttg 
gcttggaaga 

1261 attattgaga acagaacagc 
aaatacttac 
15 1321 catggagctt caaaagaaat 

caaataacaa 

1381 agaagtagaa cttgaagaat 
ttttatatga 

1441 aaataaacaa tttgagaaga 
20 aactaattgg 

1501 tcttctccaa gccagagaga 
ctgccattac 

1561 cacaagtgaa cagtattatt 
ttgaaaacga 
25 1621 gaagcttaag aatactgaat 

aaaacaaaga 

1681 gctcacacag gaaacaagtg 
aagatattaa 

1741 taataacaaa aagcaagaag 
30 aagaaacaga 

1801 aacccaatta agaaatgaac 
aaagagatga 

1861 agttaaatgt aaattggaca 
aacaagttga 
35 1921 aaataaaaac aagtatattg 

aaaaaaaagg 

1981 tacagcagaa agcaagcaac 
tagagttaga 



ctgagaaaga aaataaaatg aaagatttaa 
ttaatcaatt agaggaaaag acaaaattac 
aacagcatca tttgactaaa gaactagaag 
gtactcaaaa ggctttagag gaagatttac 
ctgaagaaaa agaaactcaa atggaagaat 
tggttactga atttgaaact actgtctgca 
aaagattgga aaaaaatgaa gatcaattga 
caagtgagct ggaagagatg actaagctta 
tgaaaaaagt cttgggagaa aaggaaacac 
ttgctgaaga attaaaagga acagaacaag 
aagaagtaca tgatttggaa atacagttaa 
caaaagaggt taaagatcta aaaactgagc 
taacttcaca ctgcaacaag ctttcactag 
atatgaccct agaactcaag aatcagcaag 
aaaggatgtt gaaacaaata gaaaatcttc 
tagaatatgt gagagaagag ctaaaacaga 
agagtgaaga aaattgtaac aatttaagga 
aagaacttca gcaggagaat aaggccttga 
tgaatgttta tgagataaag gtcaataaat 
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2041 actagaaagt gccaaacaga 
aagaaattga 

2101 ggacaaaaag atatcagaag 
aagtaatagc 
5 2161 tgatgaagca gtaaaattac 

aaatagctga 

2221 aatggtagca cttatggaaa 
aagaaagaga 

2281 ctcagaatta ggactttata 
10 gagcatcttt 

2341 ggagattgaa ctatccaatc 
aacttgaaat 

2401 agaaagagaa gagaaggaaa 
ctactcttaa 
15 2461 agaaaaaaaa gacaagaaaa 

tttattggaa 

2521 attggattct aaagcagttc 
cagttgatca 

2581 tggcatatcc aaagataaaa 
20 ctttatctac 

2641 accattgcca aaggcatata 
agcaaagaga 

2701 aaacttgaat atacccattg 
ttgaatttga 

25 2761 tattaattca gatagttcag 

aagaagagac 

2821 attgaaaaca ctgtatagga 
tcaaaacacc 

2881 aaaaaaggcc ccttcatctc 
30 gagctataag 

2941 aaaaatgcgg gaggaccgtt 
aaaaactaaa 

3001 agaagctgaa aagttatttg 
agcctaataa 
35 3061 cgtgaaactt atagttaata 

atctggaagt 

3121 tgagacttaa aaaatacttg 
agcctaaatg 



aatttggaga aatcacagac acctatcaga 

aaaatctttt ggaagaggtt gagaaagcaa 

agaaagaaat tgataagcga tgtcaacata 

aacataagca ccaatatgat aagatcattg 

agagcaaaga acaagaacag tcatcactga 

tcaaagctga acttttgtct gttaagaagc 

aactcaaaag agaggcaaaa gaaaacacag 

cacaaacatt tttattggaa acacctgaaa 

cttcacaaac tgtatctcga aatttcacat 

gagactatct gtggacatct gccaaaaata 

cagtgaagac accaacaaaa ccaaaactac 

aagaaagtaa aaaaaagaga aaaatggcct 

aaactactga tcttttgagc atggtttcag 

acaataatcc accagcttct catctttgtg 

taacaacccc tggacctaca ctgaagtttg 

gggctgtaat tgctaaaatg gatagaaaaa 

tttaatttca gagaatcagt gtagttaagg 

ttttgttctt atttgccaga gccacatttt 

catgaatgat ttgtgtttct ttatattttt 
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3181 ttaactacat attgtctgga aacctgtcat tgtattcaga taattagatg 
attatatatt 

3241 gttgttactt tttcttgtat tcatgaaaac tgtttttact aagttttcaa 
atttgtaaag 

5 3301 ttagcctttg aatgctagga atgcattatt gagggtcatt ctttattctt 

tactattaaa 

3361 atattttgga tgcaaaaaaa aaaaaaaaaa aaa 

// 

10 

SSX-4 

LOCUS NM_005636 576 bp niRNA linear 

PRI lO-DEC-2001 

DEFINITION Homo sapiens synovial sarcoma, X brealcpoint 4 {SSX4) , 
15 mRNA. 

ACCESSION NM_005636 

VERSION Niyi_005636,l 01:5032122 

KEYWORDS 

SOURCE human . 

20 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

25 REFERENCE 1 (bases 1 to 576) 

AUTHORS Gure^A.'O., Tureci^O., Sahin,U., Tsang,S., 

Scanlan,M. J. , Knuth.A, , 

Pf reiindschuh.M. , Old, L.J. and Chen,Y.T. 
TITLE SSX: a raultigene family with several members 

30 transcribed in normal 

testis and human cancer 
JOURNAL Int. J. Cancer 72 (6), 965-971 (1997) 
MEDLINE 98021352 
COMMENT PROVISIONAL REFSEQ: This record has not yet been 

35 subject to final 

NCBI review. The reference sequence was derived from 

U90841.1. 

FEATURES Location/Qualifiers 
source 1. .576 
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/orgaiiisin="Homo sapiens" 
/db_xref = t axon : 96 0 6 " 
/ chromosome =" X " 
/inap="Xpll.3" 
1. .576 

/gene="SSX4" 
/db_xr e f = " Locus ID : 67 5 9 " 
/db_xref="MIM: 300326" 
1. .567 

/gene="SSX4" 

/note="Kruppel -associated box containing SSX 

/codon_start=l 
/db_xref ="IjOCUsID: 6759" 
/dh_xref = "MIM : 3 00326 " 

/product=" synovial sarcoma/ X breakpoint 4" 
/pr o t e in_id= " NP_0 05627.1" 
/db_xref ="GI : 5032123 " 

20 /translation="MNGDDAFARRPRDDAQISEKLRKAFDDIAKyPSKKEWEKMKSSEKIVY 

VYMKLNYEVMTKLGFKVTLPPFMRSKRAADFHGNDFGNDRN^^ 

SLQRIFPKIMPKKPAEEENGLKEVPEASGPQNDGKQLCPPGNPSTLEKINKTSGPKRG 

KHAWTHRLRERKQLWYEEISDPEEDDE " 
misc_f eatiire 70 . . 246 
25 /note="KRAB; Region: krueppel associated box" 

BASE COUNT 187 a 127 c 150 g 112 t 

ORIGIN 

1 atgaacggag acgacgcctt tgcaaggaga cccagggatg atgctcaaat 
atcagagaag 

30 61 ttacgaaagg ccttcgatga tattgccaaa tacttctcta agaaagagtg 

ggaaaagatg 

121 aaatcctcgg agaaaatcgt ctatgtgtat atgaagctaa actatgaggt 
catgactaaa 

181 ctaggtttca aggtcaccct cccacctttc atgcgtagta aacgggctgc 
35 agacttccac 

241 gggaatgatt ttggtaacga tcgaaaccac aggaatcagg ttgaacgtcc 
t-cagatgact 

301 ttcggcagcc tccagagaat cttcccgaag atcatgccca agaagccagc 
agaggaagaa 
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361, aatggtttga aggaagtgcc agaggcatct ggcccacaaa atgatgggaa 
acagctgtgc 

421 cccccgggaa atccaagtac cttggagaag attaacaaga catctggacc 
caaaaggggg 

5 481 aaacatgcct ggacccacag actgcgtgag agaaagcagc tggtggttta 

tgaagagatc 

541 agcgaccctg aggaagatga cgagtaactc ccctcg 

All patents and publications mentioned in the specification 

10 are indicative of the levels of those skilled in the art to v/hich 

the invention pertains. All patents and publications are herein 
incorporated by reference to the same extent as if each individual 
publication was specifically and individually indicated to be 
incorporated by reference. 

15 The invention illustratively described herein suitably may be 

practiced in the absence of any element or elements, limitation or 
limitations which is not specifically disclosed herein. The terms 
and expressions which have been ett^loyed are used as terms of 
description and not of limitation, and there is no intention that 

20 in the use of such terms and expressions indicates the exclusion 

of equivalents of the features shown and described or portions 
thereof. It is recognized that various modifications are possible 
within the scope of the invention claimed. Thus, it should be 
understood that although the present invention has been 

25 specifically disclosed by preferred embodiments and optional 

features, modification and variation of the concepts herein 
disclosed may be resorted to by those skilled in the art, and that 
such modifications and variations are considered to be within the 
scope of this invention as defined by the appended claims. 

30 
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WHAT IS CLAIMED IS : 

1. An isolated epitope, comprising a con^wnent selected from the group consisting 

of: 

(i) a polypeptide having the sequence as disclosed in TABLE 1 ; 
5 (ii) an epitope cluster comprising the polypeptide of (i); 

(iii) a polypeptide having substantial similarity to (i) or (ii); 

(iv) a polypeptide having functional similarity to any of (i) through (iii); and 

(v) a nucleic acid encoding the polypeptide of any of (i) through (iv). 

2. The epitope of claim 1 , wherein the epitope is immunologically active. 

10 3. The epitope of claim 1, wherein the polypeptide is less than about 30 amino acids 

in length. 

4. The epitope of claim 1, wherein the polypeptide is 8 to 10 ammo acids in length. 

5. The epitope of claim 1, wherein the substantial or fimctional similarity comprises 
addition of at least one amino acid. 

15 6. The epitope of claim 5, wherein the at least one additional amino acid is at an N- 

terminus of the polypeptide. 

7. The epitope of claim 1 , wherein the substantial or functional similarity comprises a 
substitution of at least one amino acid. 

8. The epitope of claim 1, the polypeptide having affinity to an HLA-A2 molecule. 
20 9. The epitope of claim 8, wherein the affinity is determined by an assay of binding. 

10. The epitope of claim 8, wherein the affinity is determined by an assay of restriction 
of epitope recognition. 

11. The epitope of claim 8, wherein the affinity is determined by a prediction 
algorithm. 

25 12. The epitope of claim 1, the polypeptide having affinity to an HLA-B7 or HLA-B5 1 

molecule. 

1 3 . The epitope of claim 1, wherein the polypeptide is a housekeeping epitope. 

14. The epitope of claim 1, wherein the polypeptide corresponds to an epitope 
displayed on a tumor cell. 

30 15. The epitope of claim 1, wherein the polypeptide corresponds to an epitope 

displayed on a neovasculature cell. 

16. The epitope of claim 1, wherein tiie peptide is an immune epitope. 

1 7. The epitope of claim 1 wherein the epitope is a nucleic acid. 

18. A pharmaceutical composition comprising the peptide of claim 1 and a 
35 pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 

19* The composition of claim 18, where the adjuvant is a polynucleotide, 
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20. The conqwsition of claim 19 wherein the polynucleotide comprises a dinucleotide. 

21 . The composition of claim 20 wherein the dinucleotide is CpG. 

22. The composition of claim 18, wherein the adjuvant is encoded by a polynucleotide. 

23. The composition of claim 18 wherein the adjuvant is a cytokine. 
5 24. The composition of claim 23 wherein the cytokine is GM-CSF. 

25. The composition of claim 18 further comprising a professional antigen-presenting 
ccll(pAPC). 

26. The composition of claim 25, wherein the pAPC is a dendritic cell. 

27. The con^)osition of claim 1 8, further comprising a second epitope. 

1 0 28. The composition of claim 27, wherein the second epitope is a polypeptide. 

29. The composition of claim 27, wherein the second epitope is a nucleic acid 

30. The composition of claim 27, wherein the second epitope is a housekeeping 

epitope. 

3 1 . The con^osition of claim 27, wherein the second epitope is an immune epitope. 

15 32. A pharmaceutical composition comprising the nucleic acid of claim 1 and a 

pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 

33. A recombinant construct comprising the nucleic acid of Claim 1 . 

34. The construct of claim 33, further comprising a plasmid, a viral vector, or an 
artificial chromosome. 

20 35. The construct of claim 33, further comprising a sequence encoding at least one 

feature selected from the group consisting of a second epitope, an IRES, an ISS, an NIS, and 
ubiquitin. 

36. A purified antibody that specifically binds to the epitope of claim 1. 

37. A purified antibody that specifically binds to a peptide-MHC protein complex 
25 comprising the epitope of claim 1, 

38. The antibody of claim 36 or claim 37, wherein the antibody is a monoclonal 
antibody. 

39. A multimeric MHC-peptide complex comprising the epitope of claim 1. 

40. An isolated T cell expressing a T cell receptor specific for an MHC-peptide 
30 complex, the complex comprising the epitope of claim 1 . 

41 . The T cell of claim 40, produced by an in vitro immuni2ation. 

42. The T cell of claim 40, isolated from an immunized animal. 

43 . AT cell clone comprising the T cell of claim 40. 

44. A polyclonal population of T cells comprising the T cell of claim 40. 

35 45. A pharmaceutical conposition comprising the T cell of claim 40 and a 

pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 
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46. An isolated protein molecule comprising tiie binding domain of a T cell receptor 
specific for an MHC-peptide complex, the complex comprising tiie epitope of claim 1. 

47. The protein of claim 46, wherein the protein is multivalent. 

48. An isolated nucleic acid encoding the protein of claim 46. 

49. A recombinant construct comprising the nucleic acid of claim 48. 

50. A host cell expressing the recombinant construct of claim 33 or 49. 

51. The host ceD of claim 50, wherein the host cell is a dendritic cell, macrophage, 
tumor cell, or tumor-derived cell. 

52. The host cell of claim 50, wherein the host cell is a bacterium, fungus, or 
protozoan. 

53. A pharmaceutical composition comprising the host cell of claim 50 and a 
pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 

54. A vaccine or imnumotherapeutic con^osition conpising at least one component 
selected from the group consisting of the epitope of claim 1; the composition of claim 18, 32, 45, or 
53; the construct of claim 33; the T cell of claim 40, and the host cell of claim 50. 

55. A method of treating an animal, comprising: 

administering to an animal the vaccine or immunotherapeutic composition of claim 

54. 

56. The method of claim 55, wherein the administering step comprises a mode of 
delivery selected from the group consisting of transdermal, intranodal, perinodal, oral, intravenous, 
intradermal, intramuscular, intraperitoneal, mucosal, aerosol inhalation, and instillation. 

57. The method of claim 55, further comprising a step of assaying to determine a 
characteristic indicative of a stale of a target cell or target cells. 

58. The method of claim 57, comprising a first assaying step and a second assaying 
step, wherein the first assaying step precedes the administering step, and wherein the second 
assaying step follows flie administering step. 

59. The method of claim 58, further comprising a step of comparing the characteristic 
determined in the first assaying step with the characteristic determined in the second assaying step 
to obtain a result. 

60. The method of claim 59, wherein the result is selected from the group consisting 
of: evidence of an immune response, a diminution in number of target cells, a loss of mass or size 
of a tumor comprising target cells, a decrease in nimiber or concentration of an intracellular 
parasite infecting target cells. 

61. A method of evaluating immunogenicity of a vaccine or immunotherapeutic 
composition, comprising: 
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administering to an animal the vaccine or iramunotiierapeutic composition of claim 

54; and 

evaluating immunogenicity based on a characteristic of the animal. 
62. The method of claim 61 , wherein the animal is HLA-transgenic. 
5 63. A method of evaluating immunogenicity , comprising: 

in vitro stimulation of a T cell with the vaccine or immunotherapeutic composition 
ofclaim54;and 

evaluating immunogenicity based on a characteristic of the T cell. 
64. The method of claim 63, wherein the stimulation is a primary stimulation. 
10 65 . A method of making a passive/adoptive immunotherapeutic, comprising: 

combining the T cell of claim 40 or the host cell of claim 50 with a 
pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 
66. A method of determining specific T cell frequency comprising the step of 
contacting T cells with a MHC-peptide con^lex comprising the epitope of claim 1. 
15 67. The method of claim 66, wherein the contacting step conq)rises at least one 

feature selected from the group consisting of immunization, restimulation, detection, and 
enumeration. 

68. The method of Claim 66, further comprising ELISPOT analysis, limiting dilution 
analysis, flow cytometry, in situ hybridization, the polymerase chain reaction or any combination 

20 thereof. 

69. A method of evaluating immunologic response, comprising the method of claim 66 
carried out prior to and subsequent to an mummization step. 

70. A method of evaluating immimologic refuse, comprising: 

deteraaining frequency, cytokine production, or cytolytic activity of T cells, prior 
25 to and subsequent to a step of stimulation with MHC-peptide complexes comprising the 

epitope of claim 1. 

71. A method of diagnosing a disease comprising: 

contacting a subject tissue with at least one component selected from the group 
consisting of the T cell of claim 40, the host cell of claim 50, the antibody of claim 36, the 
30 protein of claim 46; and 

diagnosing the disease based on a characteristic of the tissue or of the component. 

72. The method of claim 71, wherein the contacting step takes place in vivo, 

73. The method of claim 71, wherein the contacting step takes place in vitro. 

74. A method of making a vaccine, comprising: 

35 combining at least one component selected from the group consisting of the 

epitope of claim 1; the conq)osition of claim 18, 32, 45, or 53; the construct of claim 33; 
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the T cell of claim 40, and the host cell of claim 50, with a pharmaceutically acceptable 
adjuvant, carrier, diluent, or excipient. 

75. A computer readable medium having recorded thareon the sequence of any one of 
SEQ ID NOS: X -Y, in a machine having a hardware or software that calculates the physical, 
biochemical, immunologic, or molecular genetic properties of a molecule embodying said 
sequence. 

76. A method of treating an animal comprising combining the method of claim 55 
combined with at least one mode of treatment selected from the group of radiation therapy, 
chemotherapy, biochemotherapy, and surgery. 

77. An isolated polypeptide comprising an epitope cluster from a target-associated 
antigen having the sequence as disclosed in Tables 25-44, wherein the amino acid sequence 
consists of not more than about 80% of the amino acid sequence of the antigen. 

78. A vaccine or immunotherapeutic product comprising the polypeptide of claim 78. 

79. An isolated polynucleotide encoding the polypeptide of claim 78. 

80. A vaccine or immunotherapeutic product comprising the polynucleotide of claim 



82. 



81. 



The polynucleotide of claim 79 or 80, wherein the polynucleotide is DNA. 
The polynucleotide of claim 79 or 80, wherein the polynucleotide is RNA. 
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Graphs show lysis of unpulsed EL4 cells (open circles) and EL4 
cells pulsed with gp33 peptide (solid triangles). Symbols represent 
Individual nnlce and one of three similar experiments Is shown. 
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Days after tumor challenge 



Mean tumor volumes ± 1SD are shown for mice immunized 
with pEFGPL33A DNA (solid circles) or control pEGFP-N3 
DNA (open triangles). Numbers in brackets indicate number 
of mice with tumors/total number of mice in group. One 
of two similar experiments is shown. 
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